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Performance in Role-Playing Situations as Related to 
Leadership and Personality Measures * 


Ernest C. Tupes and A. Carp, Wright Air Development Center 
WALTER R. Borc, Utah State University 


INTRODUCTION 


Although role-playing situations have been used extensively as aids to 
personality diagnosis and psychotherapy relatively little attention seems to 
have been paid to their possibilities as selection devices, and even less to an 
evaluation of their validities. During World War II, the OSS utilized role- 
playing as one of their assessment techniques (7). In recent years several 
industries and government agencies have incorporated role-playing situations 
into their employee screening procedures; however the situations appear to 
be relatively unstandardized and with no follow-up data available (4, 6, 8). 
Kelly and Fiske (3) used role-playing behavior as a basis for making predic- 
tions of future success in clinical psychology and present both concurrent 
and predictive validity statistics. More recently, Suttell has described the 
development of a battery of 16 standardized role-playing situational perform- 
ance tests and presents data indicating their concurrent validity against 
officer training criteria (9). Mann (5) has summarized the status of role- 
playing research by stating that “Perhaps the most striking impression to be 
gained from a review of the experimental studies of role-playing is their 
scarcity.” 

The general purpose of the present study was to determine the predictive 
validity of performance in role-playing situations designed to measure officer 
potential for criteria of military performance and officer effectiveness obtained 
six months after the administration of the situations. A second objective of 
the research was to explore the relationships between performance on these 
role-playing situational tests and other performance tests which were also 
hypothesized to be measures of officer potential. A third purpose of the study 
was to determine some of the personality correlates of performance in role- 
playing situations. Since relatively little is known about various possible 
procedures for scoring role-playing situations, a final objective of the research 


1 The experimental work for this study was carried out under Air Force Personnel 
and Training Research Center in support of Project 7719, Task 17009. Permission is 
granted for reproduction, translation, publication, and use or disposal in whole or in part 
by or for the United States Government. 
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was to investigate scores obtained by two quite different scoring methods with 
respect to their intercorrelation, their relative reliability, and their relative 
correlations with other variables. 

The following specific null hypotheses were tested by this study: 

1. Scores on role-playing situations designed to measure officer potential 
are not related to later criteria of military performance and officer effective- 
ness. 

2. Role-playing scores are not related to scores on other performance tests 
designed to be measures of officer potential and small group leadership. 


3. Role-playing scores are not related to personality ratings rendered by 
peers. 


METHOD 


Subjects 


The data analyzed in the present investigation were gathered in connection 
with an assessment of all male members of USAF Officer Candidate School 
Classes 1955B and 1955C. The study is based on 125 male graduates of 
Class 55B and 96 male graduates of Class 55C for whom complete test data 
were available. Before entering OCS the subjects had been screened in such 
a way that all were at least high-school graduates with at least one year of 
enlisted military service. Eighty-five per cent of the group would fall in the 


upper 10 per cent of the general population with respect to general intelli- 
gence, and the majority were planning on a career in the Air Force. 


Procedure 


As the subjects in each class reported to the school (55B in late December 
1954 and 55C in March 1955) they were assigned to six-man teams, in such 
a way that no member of a team had been acquainted with any other team 
member prior to reporting to OCS. Fellow team members were in very close 
contact during the three-and-one-half day testing period and were isolated 
from other teams, except that they shared a barracks floor and ate with 
members of one other team. 

At the close of the testing period, the subjects were regrouped into OCS 
flights of about 20 candidates. Each flight was supervised by a Tactical 
Officer and by a corresponding flight of OCS upperclassmen. Members of a 
flight were quartered together and attended classes and participated in other 
types of training as a unit. In reforming the assessment groups into OCS 
flights, care was taken so that no two members of any assessment group 
were assigned to the same OCS flight. 
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The Role-playing Situation Tests 


Six of sixteen role-playing situations developed under an Air Force contract 
by the American Institute for Research (9) were adapted for the present 
study. Each of the six situations is an individual performance problem 
designed to simulate the job situation of Air Force junior officers. Each 
situation places the subject in a problem situation involving face-to-face 
contact with one or more military subordinates (played by OCS upperclass- 
men actors) with a limited amount of time (5 to 10 minutes) in which to 
reach some kind of a solution to the problem. For example, in Problem /, 
Acting on Leave Request, the subject is assigned the role of a Squadron 
Commanding Officer who has received leave requests from three pilot instruc- 
tors (Lt’s. Alfa, Baker, and Coca). Because of the anticipated student load 
only two can be spared. Thus, one request must be denied. The CO has called 
the instructors to his office and must make a decision and then notify the 
one whose request must be denied. He has 10 minutes to accomplish this. 
Effective officer behavior consisted in eliciting from each officer the reasons 
he wanted leave and in granting leave to the two who deserved it most. The 
other five role-playing situations were similar, and are described briefly below: 


Problem II, Conducting Reenlistment Interviews. The subject is assigned the role of base 
recruiting officer and must interview all airmen whose enlistments are about to expire. 
Two airmen, one highly qualified, the other borderline, are awaiting interviews. He 
has 5 minutes to interview the two. 

Problem III, Interview with Irate Mayor. The subject is assigned the role of adjutant of 
a training base. The Commanding Officer is on leave, and the adjutant is temporarily 
in charge. After a squadron party several airmen have gotten into trouble in the nearby 
town. The adjutant is faced with the mayor of the town who has come out to discuss 
the situation. The mayor acts in an aggressive, abusive, ineffective manner attempting 
to make the adjutant angry and lose his composure. 

Problem IV, Adjutant Reprimand. The subject is assigned the role of Captain Alpha, a 
squadron adjutant, who must deal with a clerk for his failure to comply with a specific 
request to type some orders. This has happened previously and Captain Alpha feels 
he should deal with the airman severely. The airman has some pertinent information 
and gripes which complicate the problem. 

Problem V, Promotion Problem. The subject is assigned the role of Major Beta, the 
Squadron Commanding Officer. Major Beta has just promoted Lt. Wood to Squadron 
Operations officer and is now talking with Lt. Field who had also been in line for 
the job. The Major does not know what Lt. Field thinks but believes he wanted the 
promotion himself. Actually, Lt. Field feels that a third man, Lt. Peterson, should 
have the promotion. Success in the problem consists of attempting to determine the 
position which Lt. Field holds prior to defending his decision on the promotion and 
of maintaining Lt. Field’s cooperation. 

Problem VI, Transfer Problem. The subject is assigned the role of Major Baker, newly 
assigned to a squadron as Squadron Commanding Officer. This squadron has been 
ineffective in the recent past. Headquarters has indicated that a few of the most in- 
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efficient officers may be transferred. The Major is interviewing these officers to decide 
which should be transferred and to attempt to locate the source of the inefficiency. 
Captain Jones, Squadron Armament officer, is the next to be interviewed. Successful 
solution to the problem requires the Major to discover that low squadron morale is the 


cause of the inefficiency and to decide not to transfer Captain Jones who is actually a 
fairly effective officer. 


Each problem was administered and observed by two individuals (an OCS 
staff officer and an OCS upperclassman). Each was provided with a score 
sheet, developed specifically for the particular problem, containing brief 
statements of critical behaviors each problem was designed to measure. As 
each behavior occurred, the observers so indicated by checking an appropriate 
box on the score sheet. Thus each subject was observed and scored on each 
problem by two individuals and his total score on all six problems was con- 
tributed to by twelve observers. The check list for Problem I is shown below 
to indicate the type of behaviors included. 


PROBLEM I—LEAVE REQUEST 


(Score only if and as behavior occurs. Score by marking cross (X) in 
appropriate blank) 
. Explains situation to Alfa. 
. Explains situation to Baker. 
Explains situation to Coca. 
. Requests needed information from Alfa at beginning of inter- 
view. 
. Requests needed information from Baker at beginning of inter- 
view. 
. Requests needed information from Coca at beginning of inter- 
view. 
. Makes decision on leave only after consulting all the men. 
. Attempts to determine if Alfa could postpone leave. 
ee 9. Attempts to determine if Coca could postpone leave. 
. Commends Alfa for his good work. 
. Gives each man a personal interview. 
. Gives leave to Baker and Coca. 


3. Explains reason for decision to man whose leave request is 
denied. 
. Expressed regret that leave could not be granted. 


Two other observers (OCS upperclassmen) were also assigned to each 
subject and observed him on all six problems. After the subject completed all 
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problems, these two observers independently rated him on a series of 21 
personality and other traits. 

Thus, the role-playing situations were the basis for two quite different 
types of scores. The behavior check lists provided scores which were essen- 
tially recordings of critical behaviors as they occurred. The personality ratings 
may be regarded as inferences concerning underlying traits based on observa- 
tions of these behaviors. Comparisons between these two types of measure 
are of interest since they, in effect, indicate whether the same situations when 
viewed in two different ways yield the same results. Is the sum of specific 
behaviors the same as the sum of inferences based on those specific behaviors? 
Given two equivalent groups of observers, is anything gained by permitting 
one group to make what inferences they can from specific behavior, or do 
the biases and inaccuracies of judgment of the raters result in less accuracy of 
the inferential scores than of the behavior scores? 

That the results of the study might be more readily generalizeable to the 
operational selection situation, an effort was made to design the role-playing 
situations (and all other assessment procedures) in the form in which they 
might be used by Air Force personnel. Subjects were not informed of the 
experimental nature of the study; in fact they were told that the results 
would become a part of “permanent records.” OCS staff administrators and 
OCS upperclass observers and actors were typical military personnel. No 
formal training was given on any of the assessment techniques except that 
received in a 30-minute briefing at which time instruction cards were dis- 
tributed. Nearly all actors and observers for the Class 55C assessment had 
been subjects in the Class 55B assessment; however no differences were 
found as a result of this factor. 


Development of the Role-playing Situation Scores 


Although the effective officer behaviors which the situations were designed 
to measure have typically been found to be rather highly intercorrelated, the 
behaviors as checked on the behavior check lists were found to be highly 
specific, with an average intercorrelation (phi coefficient) of about .05. The 
total score, obtained by summing all behaviors checked, was found to have 
an internal consistency * of only .25, which indicated either that several inde- 


2 As used in this paper, the term “internal consistency” refers to internal relationships 
among the items comprising a measure and is estimated by the Kuder Richardson 
Formula 20 modified for continuous distributions. Thus the above statement that the 
total score had an internal consistency of .25 means that the correlations among the items 
were such that the Kuder Richardson Formula 20 applied to their sum yielded a coefficient 
of .25. The term “reliability” as used in this paper consistently refers to the amount of 
agreement between two or more raters using the same scale, and is estimated by applying 
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pendent factors were present or that some of the behaviors should not be 
scored. Accordingly, some behavior items which were checked for more than 
95 per cent or less than 5 per cent of the subjects*itv’Class 55B were dis- 
carded, as were items which had approximately zero correlations with the 
total score. The remaining items were combined into a revised Behavior 
score which was found to have an internal consistency of .42 on Class 55B 
and .37 on Class 55C. The between-raters correlation of this revised score 
was found to be .81 which gives an estimated reliability of .90 for the 
Behavior score based upon check lists completed by two observers. The 
Behavior score is, presumably, a measure of the subject’s ability to behave 
effectively in officer situations. 

The ratings on the trait variables were analyzed in a similar manner. Cor- 
relations between raters, trait by trait, were found to average .25 for Class 
55B and .30 for 55C. Intercorrelations were obtained between the trait ratings 
summed across the two observers. Inspection of these intercorrelations indi- 
cated that most were higher than the rater-rater reliabilities of the separate 
traits, and it was obvious that only a few variables were actually being 
measured. A clustering process * was performed which resulted in three trait 
scores. The first of these was a general score, encompassing most of the 
original variables. This General Trait score presumably was a measure of 
the raters’ opinion of the subject’s over-all adequacy in the situation. The 
second score was the rating on the single variable Athletic Ability. The third 
score was a two-variable cluster identified as Assertiveness. The three trait 
scores had estimated reliabilities, based on the agreement between raters, of 
39, .31, and .34 respectively. 

Intercorrelations among the four scores finally based on the role-playing 
situations appear in Table 1. Intercorrelations for Class 55B are shown above 
the diagonal; those for 55C are below. The intercorrelations for Class 55C 
may be regarded as a cross-check upon the combined scores which were 
developed on Class 55B data. The fact that the pattern of intercorrelations is 
highly similar in the two classes suggests that the scores, and the situations 
themselves, may be somewhat more stable than would be indicated by the 
more direct estimates of reliability. 


the Spearman Brown Prophecy Formula to the correlation between raters, with “a” equal 
to the number of raters. Unless otherwise indicated, all correlations reported in this paper 
are product moment coefficients. 

8 The clusters were developed on data from Class 55B with Class 55C serving as a 
check upon the accuracy of the procedure. Variables were grouped into a cluster when 
the ratio of their intercorrelations to their reliabilities was such as to indicate that their 
reliable variance was largely common to both. A formula developed by Kelley and 
discussed in Gulliksen (2, pp. 351-354), was used for this purpose. 
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TABLE 1 


Intercorrelations among Role-playing Scores 


. Behavior score 
. General Trait score 
3. Athletic score 
. Assertiveness score 18 


Note: Product-moment correlations for Class 55B (N==125) are shown above the 
diagonal; those for Class 55C (N96) below. For Class 55B, r’s of .18 are significant 
at the .0S level, two-tailed test. For Class 55C, r’s of .20 are significant at the .05 level. 


Only the Behavior score and the General Trait score were used to test the 
hypotheses of the research. These two scores appear to measure more directly 
effective or correct behavior in the role-playing situations. The Athletic and 
Assertiveness scores appear to measure fairly specific variables which are not 
especially relevant to the hypotheses. Since these two scores do represent 
two aspects of role-playing behavior, however, their relationships with the 
dependent variables are presented in this paper as a matter of interest. 


Analysis of Data 


The three null hypotheses were tested by correlating the role-playing 


situation scores with scores on other, measures selected for their appropriate- 
ness. 


Hypothesis 1 was tested by computing product-moment correlations be- 
tween the role-playing Behavior and General Trait scores with the following 
criteria, each based on performance during OCS in the six months following 
the administration of the role-playing situations: 

1. The final OCS military grade is based on paired comparison ratings of 
the candidate’s observed performance in the military training portion of OCS 
plus his officer potential. Ratings by fellow flight members, by members of 
the upperclass flight which supervised him, and by his tactical officer are 
averaged. 

2. The final OCS academic grade is based upon achievement in the aca- 
demic portion of OCS training. Achievement in each academic course, as 
measured by objective examinations, is combined into an over-all composite. 

3. Distinguished graduate status is assigned to the upper 5 to 10 per cent 
of each class on military and academic grades. Distinguished versus regular 
graduate status is a dichotomous criterion and correlations with it are point- 
biserials. 


4. The final criterion was a simulated Officer Effectiveness Report (OER) 
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criterion. OER’s are completed on each Air Force officer by his supervisor at 
intervals of not more than one year, and are the primary basis for various 
personnel actions such as promotions, integration into the Regular Air Force, 
etc. An OER is comprised of ratings on six specific characteristics and a 
rating on the officer’s over-all value to the Air Force. The simulated OER 
scores were developed on the basis of previous research (10) and are a com- 
bination of final military grades, final academic grades, and peer ratings on 
certain personality traits obtained at the end of OCS. 

Each of the above measures has demonstrated significant correlations with 
later officer effectiveness with other samples of OCS graduates. The Simulated 
OER scores are especially appropriate as intermediate criteria of officer suc- 
cess since they correlated .53 (a value which becomes .75 when corrected for 
unreliability of the effectiveness reports) with later operational officer effec- 
tiveness reports (OERs) completed over periods covering from one to five 
years of active commissioned service. 

The second hypothesis was tested by computing product-moment correla- 
tions between the role-playing scores and the following: 

1. Impromptu Speech Score. Each subject was given 5 minutes to prepare 
a speech and 5 minutes to present it (he was not allowed to read it, although 
he could use notes). Two observers independently rated him on a number of 
check-list items (e.g., did he talk coherently; did he attempt to hide behind 
the podium, etc.), designed to measure the effectiveness of his presentation. 
The total Impromptu Speech score was the sum of the checks the subject 
received by the two observers. The reliability of the score, based on the 
agreement between the observers, is estimated to be .88. 

2. Leaderless Group Discussion. Each group of six subjects was given a 
topic (in case of all-out mobilization, should the Air Force rely upon the 
AFROTC program or a greatly expanded OCS as the primary source of new 
officers?) which they had 30 minutes to discuss and arrive at a conclusion. 
No group leader was designated. Three observers rated each member of the 
group on each of nine items, using a 5-point scale for each item. The items 
covered effective behaviors such as the effectiveness of each subject, relevance 


of his remarks, etc. The Leaderless Group Discussion score is based upon the 
sum of the ratings of each observer, summed across all three observers. The 
reliability of the LGD score, based on agreement between raters, is estimated 
to be .82. 


3. Project X Small-group Construction Test. This small-group leadership 
test consisted of 12 situational construction problems. These problems all 
required cooperation of team members in developing and carrying out a 
solution, and provided a situation in which leadership behavior could be 
displayed. In a typical problem, the team was placed in a prison compound 
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and required to escape across a moat and over a solid board fence. A ladder 
was provided but it was not long enough to reach from the edge of the moat 
to the top of the fence. Several short lengths of rope were also provided. 
Certain areas, such as the moat, were painted red and could not be touched. 
The problem was solved by holding the ladder at an angle over the moat with 
ropes. One team member climbed the ladder and jumped to the fence. Once 
over the fence, he discovered additional props which could be used to help 
the remainder of the team to escape. A time limit of 12 minutes of actual 
working time was imposed on each problem, which was not usually sufficient 
to complete the problem. Solution of the problem was not necessary, however, 
for the rating scales used were directed toward behavior of team members 
rather than solution of the problem. 

The problems were administered by OCS staff officers. Two officers were 
assigned to each team to administer the first six problems. At the end of six 
problems the officers were shifted, and the last six problems were administered 
by two other officers. Six upperclassmen were assigned to each team to act 
as raters. An upper-class rater was assigned to each individual in the team, 
and rated only the individual assigned to him for a given problem. At 
the end of each problem, raters were rotated. This system resulted in each 
team member’s being rated twice by each rater in the course of completing 
the 12 problems. The rating device employed was a behavior check list con- 
sisting of 11 categories of effective behavior such as ability to plan, ability to 
define the problem, etc. Raters checked the scale for the man they were 
assigned to observe each time he displayed one of the behaviors on the check 
list. These checks were summed across all 12 problems. 

In addition to the Behavior score rendered by the upperclassman observers, 
each of the four officers supervising the problem administration made inde- 
pendent general ratings based on their observations of the subjects during 
the problems. These ratings were combined with the Behavior check-list 
scores to obtain the Project X score used in this study. This score had an 
estimated reliability (based on agreement between raters) of .80. 

4. Physical Proficiency Score. This score is based upon three physical 
proficiency measures (1) and performance upon Project Y. The three physical 
proficiency tests consist of (a) the number of dips the subject could accom- 
plish on parallel bars, (b) the number of feet he could throw a basketball, 
and (c) his speed in running 150 yards, reversing his course after each 25 
yards. Project Y, which might best be described as an oversized jungle gym 
or vertical obstacle course, consists of 12 climbing problems of which only the 
6 most difficult were used. Each problem consists of a structure, from 10 to 
40 feet high, which can be climbed fairly easily but which must be descended 
in another more difficult manner. A description of one problem will indicate 
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the abilities required. Problem 11 is a 40-foot structure consisting of four 
poles set in the ground so that the top of each pole is farther out from the 
center of the structure than the bottom. At 10-foot intervals, square plat- 
forms are attached between the four poles. Because of the angles of the poles, 
the top platform is about 4 feet longer on each side than is the next lower 
platform, which in turn is 4 feet longer than is the one below. The subject’s 
task is to climb one of the poles and then descend from platform to platform 
by sliding over the edge and swinging in with sufficient momentum to land 
on the platform below. 

Two raters rated each subject on each problem, using a check list of 
behaviors (perseverance, estimating the situation, blocking, etc.): The Project 
Y score was the sum of the checks across all raters across all six problems. 
The Project Y scores and the scores on the three physical proficiency tests 
were combined to form the Physical Proficiency score. 

Hypothesis 3 was tested by computing product-moment correlations be- 
tween the role-playing situation scores and ratings on a series of 30 bipolar 
personality traits, combined on the basis of earlier samples into seven clusters 
and seven separate trait variables (10). These ratings were obtained twice 
during OCS. The first rating was made at the end of the assessment period 
when each subject was rated by his five teammates and by the six subjects in 
the group that had shared the same barracks area. The second set of ratings 
occurred at the end of OCS when each subject was rated by his 15 to 20 


fellow flight members. It should be remembered that the assessment groups 
had been reshuffled into flights so that none of the second raters had rated 
the same subject in the first rating. 


RESULTS 


The correlations appearing in Table 2 provide a test of the first hypothesis 

concerning the relationship between role-playing scores and indicators of 
officer effectiveness. All eight relevant correlations in that table are significant 
at or beyond the 5 per cent level, which forces the rejection of the null 
hypothesis. 
None of the correlations are high enough to provide very effective prediction; 
however, it is possible that a larger number of situations, along with other 
types of performance tests, may yield a composite score with fairly useful 
validity. It is of interest to note that the correlations of the Behavior score 
with the criteria are almost exactly equal to those obtained by Suttell (9) 
even though hers were concurrent rather than predictive validities. 

The second hypothesis is concerned with the relationships between the 
role-playing scores and scores on other performance tests, each of which 
presumably measures one or more aspects of officer effectiveness. 
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TABLE 2 
Predictive Validities of Role-playing Scores for OCS Performance Criteria 


Criteria 


Military Academic Distinguished Simulated 
Predictors grade grade graduation OER 


Behavior score Pe 
General Trdit, score 13 
Athletic score 05 
Assertiveness score : ; — .03 — .08 


Note: N= 221 male graduates of OCS Classes 55B and 55C. Product-moment correla- 
tions of .13 are significant at the .05 level, two-tailed test. 


Table 3 presents the correlations between the role-playing scores and four 
other performance tests, as well as the intercorrelations of the role-playing 
tests and of the other performance tests. The magnitude of the correlations 
is low; however five of the eight relevant correlations with the performance 
tests are significant at the 5 per cent level or better. It may be concluded that 
behavior on role-playing tests designed to measure effectiveness of officer 
behavior is related to performance on other situations also designed to measure 
officer potential. 

Examination of all the intercorrelations in Table 3 suggest: the presence 
of at least three relatively independent performance factors. There would 
appear to be a physical proficiency factor (Role-playing Athletic score, 
Project X, and the Physical Proficiency score); a group assertiveness factor 
(Role-playing Assertiveness score, Leaderless Group Discussion, and Project 


TABLE 3 


Intercorrelations of Role-playing Scores and Other Assessment Performance Tests 


Variable 


. RP Behavior 
RP General trait 
RP Athletic 

RP Assertiveness 


Leaderless group discussion 
. Impromptu speech 

. Project X 

. Physical proficiency 


NO & Ww 


Note: N= 221 male graduates of OCS Classes 55B and 55C. Product-moment correla- 
tions of .13 are significant at the .05 level, two-tailed test. 
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TABLE 4 
Correlations of Role-playing Scores with Peer Personality Ratings 


Role-playing Scores 
Behavior score General trait Athletic Assertiveness 
vs. rating vs. rating v. rating vs. rating 


Personality 
Variable 1 2 


Social maturity : : <a ll 
Conventionality ; .05 .03 
Attentiveness to people. A .02 .10 
Lack of emotionality .08 13 .07 .04 
Insistent orderliness .08 . 0! .10 .00 
Adaptability .03 .03 .08 .03 03 
Energetic 13 .14 .05 .03 


Note: N = 221 male graduates of OCS Classes 55B and 55C. Product-moment correla- 
tions of .13 are significant at the .05 level, two-tailed test. 


Rating 1 was obtained at the end of the assessment (3 days of acquaintanceship). 


Rating 2 was obtained at the end of OCS (6 months of acquaintanceship) with entirely 
different rating groups. 


X); and finally a verbal factor (Speech, Leaderless Group Discussion, and 
the Role-playing General Trait and Behavior scores). 

Hypothesis 3 is tested by the correlations in Table 4 between the role- 
playing scores and ratings on a variety of personality traits. An examination 
of this table indicates that the role-playing scores were correlated with a 
considerable number of the personality ratings. Twenty-one of the 48 coeffi- 
cients are significant at or beyond the 5 per cent level. These data seem to 
indicate that the null hypothesis must be rejected. It appears that subjects 
who are high on the role-playing scores are also those who receive the more 
favorable ratings on most of the personality traits. They are rated as possess- 
ing more effective intelligence, as being higher on general adjustment, and as 
being more determined, socially mature, conscientious, and energetic, and as 
being less neurotic. In general, it would seem that subjects who score high 
on the role-playing scores are better adjusted over-all and better “all-around” 
persons. 

The role-playing Assertiveness score which seems (see Table 3) to measure 
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the ability to make oneself heard in a group or to make the group take notice 
of one, has quite a different pattern of correlations with the personality ratings 
than the other role-playing scores. It correlates significantly positively with 
ratings of Assertiveness (suggesting that it has construct validity). However, 
its correlations with the other personality ratings shift from about zero with 
the ratings obtained after assessment to predominantly negative with the 
ratings obtained six months later. It would appear, tentatively, that the value 
systems of the candidates must have altered as they proceeded through OCS. 
During assessment with its aura of competition, peers were willing to tolerate 
the assertive aggressive individual (at least they did not ascribe “bad” per- 
sonality characteristics to him). During OCS, however, where a popular class 
slogan is “cooperate and graduate,” the assertive candidate was looked upon 
so unfavorably by his peers that they rated him at the “bad” end of most 
traits when rating after completion of the OCS program, 


Comparison of Behavioral and Trait Methods of Scoring Role-playing Situa- 
tions 


A final problem investigated by the present study was a comparison of two 
methods of scoring the role-playing situations. The first comparison was in 
terms of reliability. The Behavior score had an estimated reliability (based 
on the agreement of raters) of .90 which is significantly higher than the 
reliability of the General Trait score which was an unsatisfactorily low .39. 
There is no doubt that with respect to the criterion of sheer objectivity of 
scoring the Behavior scores are far superior. 

A second comparison can be made in terms of the extent to which the 
two scores measure the same thing. If they do, then of course the extra 
expense involved in obtaining the Trait scores is unjustified. The obtained 
correlation (see Table 1) of .28 between the two scores would seem to indi- 
cate very little overlap—in fact if the obtained correlation is corrected for 
unreliability of both variables the corrected correlation is only .47 suggesting 
that the variables have somewhat more specific than common reliable vari- 
ance. Examination of the correlations of the Behavior and General Trait 
scores with other variables (see Tables 2 through 4) suggests that their 
patterns of correlations are fairly similar indicating that the specific variances 
of the scores, however, may be contributing little to their correlations with 
other measures. A rho of .70 was, in fact, found between their correlation 
patterns, suggesting that a considerable part of their correlations with other 
variables may be attributed to their common variance, but that a consider- 
able part might also be attributed to their specific variances. That is, it 
appeared that the two scores were actually somewhat different and that a 
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combination of the two might lead to higher validity than would either one 
separately. To test this, the correlations with the OCS criteria of a unit- 
weighted composite of the two were computed. These correlations were found 
to be .23, .25, .23, and .25 for the four criteria of Military and Academic 
grades, Distinguished Graduation, and Simulated OERs. These values are 
only slightly higher than the validities of the Behavior score alone (see Table 
2), and lead to the conclusion that the expense of obtaining the Trait scores 
is not justified. It would appear then, to answer the questions raised in an 
earlier section of this paper, that behavior check lists do yield more accurate 
and reliable scores than do inferential ratings; that the latter do measure 
something different than the former; but that the inferential ratings add 
little if anything to prediction based on scores from objective check lists of 
specific behaviors. 


SUMMARY AND CONCLUSIONS 


In order to test certain hypotheses concerning relationships between role- 
playing behavior and other measures of behavior, correlations between scores 
obtained from the administration of six role-playing situations to 221 Officer 
Candidate School members, and a series of military performance criteria, 
performance tests, and personality ratings were computed. The findings lead 
to several conclusions: 


1. Scores based on role-playing situations are significantly related to later 


criteria of military performance such as achievement in military academic 
subjects, ratings of military proficiency, and estimated officer effectiveness. 
The relationships obtained were fairly low, however, and it would appear that 


the role-playing tests, if used alone as selection procedures, would not provide 
very efficient prediction. 

2. Significant relationships were found between role-playing scores and 
scores on other individual and group situational performance tests also de- 
signed to measure officer potential. The correlations were low, however, so 
that if the other performance tests are valid, the use of a combination of all 
might result in efficient prediction. 

3. It is also concluded that a significant relationship exists between role- 
playing behavior and personality as measured by peer ratings on a series of 
personality traits. It appeared that the more proficient subjects in the role- 
playing tests were also those who were seen by their peers as being better 
adjusted and as being better ‘‘all-around”’ persons. 

4. A final conclusion of the research is that the most effective method of 
scoring role-playing situations is the behavior check-list method wherein lists 
of the specific behaviors expected to be elicited by the situations are prepared 
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and used by observers who merely indicate whether each behavior was present 
or absent. 


Manuscript received: October 28, 1957 
Revised manuscript received: March 18, 1958 
Ernest C. Tupes 
Chief 
Selection and Classification Branch 
Personnel Laboratory 
Wright Aid Development Center 
Box 1557 
Lackland Air Force Base, Texas 
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The Dynamics of Group Psychotherapy 
as Seen by a Sociologist 


Part Il. Some Puzzling Questions on Leadership, 
Contextual Relations, and Outcomes 


DorotHy Fans Beck, Family Service Association of America‘ 


This is the second of two papers which together attempt to delineate the 
dynamics of group psychotherapy from a sociological viewpoint. The first 
paper attempted to sketch the role systems and normative patterns of group 
psychotherapy for noninstitutionalized adult neurotics and to suggest how 
and why, in sociological terms, these patterns accomplish therapeutic improve- 
ment. The beneficial results which occur under favorable circumstances were 
hypothesized to stem from the use of an artificially structured subculture in 
which the members were brought as rapidly as possible to observe, in those 
areas where change was the goal, a new set of norms that were the approximate 
inverse of those in which their maladaptive patterns had been learned. These 
new norms were seen to force reversals of behavior, thought, and emotions in 
trouble areas, while at the same time providing for their reception with non- 
condemnatory, neutral, or unexpectedly favorable responses. These, in turn, 
were visualized as leading participants to modify gradually their internalized, 
but outdated or inappropriate, expectations toward less restrictive images of 
responses. The peer group used in this manner was seen to provide a special- 
ized “reference group” where the particular deviant values necessary for the 
correction or neutralization of inappropriate or excessive anticipatory socializa- 
tion could be given support, sanction, and embodiment in interaction patterns. 

This earlier article delineated the special assets and liabilities of groups 
when used for this process. It left unanswered, however, because of space 
limitations, three further questions of fundamental interest to sociologists, 
namely: (a) How can the group members, at a time of life when personality 
is usually considered highly resistant to fundamental change, be brought to 
accept and follow a leader who, instead of expressing their basic norms, pro- 
motes their rapid reversal, especially in those areas most charged with emo- 
tion? (b) How can the group protect its part-time deviant subculture from 
outside pressures from an often unsympathetic, or even contradictory, cultural 
context? (c) How can the process succeed in facilitating an improved post- 
therapy adjustment to an external environment, even though it follows sub- 


1 Responsibility for the views expressed rests with the author and not with the Family 
Service Association of America. 
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stantially different norms from those taught within the therapeutic circle? 
The fact that therapy groups, when successful, actually accomplish these three 
exceptionally difficult tasks has led the author to refer to these achievements as 
“social miracles,” and has stimulated a search for explanations. Those which, 
on reflection, have seemed most plausible will be shared with readers in this 
second article. 


SOURCES OF ANALYST’S SUCCESS IN REVERSING PATIENTS’ NORMS 


Parsons has called the requirement of congruence between the internalized 
value patterns of group members and the norms of the social structure in 
which they function “the fundamental dynamic theorem of sociology” (6, p. 
42). By requiring that the members come, as rapidly as possible, to observe 
norms that, in relevant problem areas, are directly the inverse of those in 
which they have been socialized, the process strains to the limit this func- 
tional requirement of a social system. In addition, social research has sup- 
ported with concrete experimental evidence the hypothesis that leaders, to 
be acceptable to a group, must themselves accept the norms of those they wish 
to lead (4). If this were the pattern in group psychotherapy, the leader would 
have to accept the group’s internalized values and thus negate his thera- 
peutic effectiveness, or face rejection by his group or its rapid dissolution. 
How then is this radical reversal, this negation of supposedly stable internal- 


ized values, actually accomplished? The author suggests that at least the 
following eleven factors are jointly operative: 


1. Selection of Members for High Motivation 


Probably the first clue to the leader’s success in reversing the initial norms 
of members lies in the self-selection of participants for high motivation. Such 
selection is, in part, assured by a patterned sequential screening for the 
legitimation of new group members. At the very outset, the external require- 
ments, such as locating a suitable therapist in a scarce market, surmounting 
the absence of guidelines, accepting an initial screening interview, arranging 
to finance the usually high cost of continuing treatment, and actually initiating 
attendance, all serve to screen for motivation. Admittedly, survival in this 
early winnowing process is influenced also by social class and subcultural 
differences in knowledge of sources, ability to pay, and acceptance of the 
help-seeking pattern. However, because of the lesser cost of group methods, 
these external barriers should be less restricting than for individualized treat- 
ment. 

In addition, the preadmission process imposes three difficult subjective 
hurdles on all types of participants: (a) the conscious self-definition of per- 
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sonal involvement in emotional difficulties, (b) open admission of need for 
help (i.e. the inability to meet the socially patterned requirement that one 
“pull oneself together’), and (c) surrender of the protective defewse of self- 
isolation and hiding. All three hurdles involve confronting a reduced self- 
image and can be surmounted only when motivation is high. Those with low 
or ambivalent motivation, or subject to heavy cross-pressures, will seldom 
reach the group. This screening process is both inevitable and consistent with 
the widely verbalized requirement for successful treatment, namely that “the 
patient must want to get well.” To sociologists, it is of considerable interest 
that this formulation corresponds so closely to one of Parsons’ basic func- 
tional requirements for a social system, namely: that a sufficient proportion 
of the actors be adequately motivated to fulfill its role-system requirements 
(6, pp. 27-28). Since the role of the patient is difficult and cannot be sur- 
mounted by swallowing pills or submitting to tests or surgery, it is not sur- 
prising that this requirement has come to be recognized. 


2. Maximization of Hope 


To the push afforded by the patient’s suffering, the analyst adds what he 
can by maximizing the pull of hope. To this end, and in so far as is consistent 
with the therapist’s integrity, hope for substantial improvement is encouraged 
and qualifications as to the price in time, cost, and suffering are minimized. 
Since adequate motivation is basic, a bit of benevolent manipulation at this 
critical stage can actually increase the patient’s prospects while a too-honest 
sharing of the therapist’s knowledge of recovery chances and ultimate cost 
could actually be dysfunctional. It is perhaps because of the great and 
admittedly mutual rewards of manipulation at this initial stage, that a norma- 
tive control, namely the requirement that the ethical practitioner may not 
guarantee a cure, has evolved to protect all concerned from the consequences 
of excessive claims. 


3. Patterned Functional Requirements for Leaders 


Acceptance for the role of group leader requires in ethical practice that he 
qualify himself by extensive specialized education and a personal analysis to 
meet the socially defined standards of technical professional competence and 
personality. In return, he is rewarded with a legitimation of his status that 
yields him an initial prestige and upper-class halo that lead his patients to 
define him as an authority figure. In addition, his presumed powers to relieve 
suffering render him an exceptionally important authority figure. These 
multiple sources of prestige facilitate his otherwise exceptionally difficult role. 
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4. “Transference” Assets 


To these initial advantages is usually added a further asset, namely a 
sufficient personal liking of the analyst so that, in spite of fears of an unknown 
process, members have not rejected the leader in the screening interview. 
Although part of this liking probably has a rational reality basis, it is com- 
monly referred to as “positive transference.” In the case of the therapeutic 
group, this liking is supplemented by similar or more diluted positive feelings 
for other selected group members. This configuration of positive affect renders 
the leader a symbol around which the group can focus, heightens his power 
to guide the difficult tasks ahead, and helps him maintain group cohesion and 
morale. It has long been considered therapeutically inadvisable to “interpret” 
the basis of this “positive transference” during the early stages of therapy. 
In the present conceptual framework the reason becomes obvious. If the 
analyst is to achieve his almost impossible task of group-norm reversal, he 
requires that additional leverage that can only come when the irrational 
aspects of the “positive transference” remain at least to some extent latent or 
unaware to participants. The balance is further tipped in the desired direction 
by careful attention to the analysis, or making manifest, of resistances. 


5. Use of Gradients in the Introduction of the Norm-Reversal Requirement 


At the outset members are usually only vaguely aware, if at all, of the 
devastatingly unconventional ethic that is to characterize their future therapy. 
In other words, they typically begin with a patterned distortion of both their 
role and situational definitions. As a result, they cannot mobilize their defenses 
to reject the process in toto. The leader intentionally explains the requirements 
only briefly, gradually, and in unthreatening language, duly modulating the 
speed and extent of his insistence on compliance to fit the tolerance limits 
of members. Compliance is rewarded with attention, interest, and implicit if 
not overt approval, each the more effective because of the analyst’s differential 
prestige and power. Willingness to confide emerges as a symbol of participa- 
tion, while nonparticipation, resistance, absences, and other patterned devi- 
ancies become in time highly visible by reason of group comment, focus, and 
interpretation. Outlets for the hostility aroused by the difficult tasks are 
drained off through the tradition that it is permissible to attack or criticize 
the therapist, or to undertake to play the role of analyst and comment on the 
deficiencies of both therapist and participants. These outlets, plus opportuni- 
ties for members to meet without the leader present, serve to keep the hos- 
tility engendered within manageable limits. As resistances and cross-pressures 
are relaxed, the new norms are tightened up. Ultimately, the group comes to 
assume, on its own, responsibility for enforcing the new norms. 
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This use of gradients is an essential adaptation since most participants 
would at the outset not be able to comply with the new requirements. Pre- 
mature insistence could lead to total break-up. Moreover, the gradual ap- 
proach provides the opportunity to experience and make manifest the latent 
barriers to compliance, so that, one by one, they can be relaxed rather than 
repressed. On the other hand, the gradient method has the patterned dis- 
advantage that patients cannot be told fully in advance what to expect. 
Inevitably, therefore, disappointments and crises arise, especially when a 
quick and painless recovery has been anticipated. Perhaps the so-called 
negative therapeutic reaction, which often follows the early “honeymoon’”’ 
period of high hopes, is an unavoidable price inherent in the use of gradients 
and the early maximization of hope. In any case, it is, though unintended, 
no longer unanticipated by therapists. In the group setting, this rebound 
reaction can perhaps be made less dévastating since progress in others is 
often already visible and peers in whom the patient has confidence can assist 
in reinterpreting early unrealistic expectations. 


6. Leader’s Differential Access to Information Channels 


In modulating requirements to the group tolerance level, the therapist has 
the advantage of greater knowledge than any possible opponent about the 
normative background of members. To information from screening interviews, 
incidental conversations, and occasional or regular private sessions with mem- 
bers, he adds his own deductions from observations, more possible to him than 
others because of his greater awareness, sensitivity, and freedom from emo- 
tional involvement. He is, therefore, in a strategic position and can encourage 
the less inhibited to set a “good” example. Once the new role patterns have 
been demonstrated, he can utilize rivalry, competition, relational rewards, and 
sanctions to keep the process moving. Anticipating trouble spots, he can 
manipulate group resources to surmount them in advance. This differential 
multisided access to key information clearly adds to his power. 


7. Use of Heterogeneous Groups 


While professional judgments differ as to the advisability of group homo- 
geneity, the balance of opinion seems to favor a reasonable diversity of per- 
sonality type within each group. Although not its major contribution, such 
diversity has the added advantage that it assures a mixed patterning vis-a-vis 
the leader. For the task of norm reversal, this is a substantial asset. A homo- 
geneous group of patients might soon become aware of a shared deviancy 
pattern and, instead of growing under the challenge of the group process, 
might by mutual reinforcement consolidate into a deviant subgroup of the 
type therapy was designed to reverse. 
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8. Affinity of Required Group Norms to the Prevailing Value Structure 


While the norms outlined in Table 1 are not identical with prevailing norms 
they are not value-free as is sometimes implied. They suggest, for example, a 
rather extreme “acting out” of the ideals of honesty, scientific objectivity, 
tolerance, equality, democracy, and mutual aid. They also receive support 
from the religious principle, “honest confession is good for the soul.” While 
these values are not usually directly cited in justification of the new role 
patterns, they do lend the process a certain latent support and legitimation. 


9. Internal Psychological Rewards for Members 


Internal gratifications for participants provide further support. The fol- 
lowing are among the rewards which help sustain a process otherwise fraught 
with anxiety and continuous deflation of the conscious self-image: (a) the 
reduction of inner tension as a result of free emotional expression (‘‘cathar- 
sis”), (b) the sense of wholeness arising from nonrejection by others of traits 
personally rejected, (c) the reassurance of progress that comes when others 
note and commend change, (d) the lessened sense of deprivation and shame 
due to the provision of a reference group composed of people with obvious 
problems, (e) the satisfaction of the need for human intimacy, and (f) the 
countering of the loneliness of the large-city milieu. 

Therapy groups are, however, imperfectly adapted to provide these sec- 
ondary psychological rewards. What they offer is far removed from the mutual 
ego-support and relaxed small talk friends naturally offer each other and is 
distinctly more critical, threatening, and provocative of anxiety. Indicative 
both of the need and the unsuitability of the active therapy group to meet 
these needs are the “kaffee-klatch” sessions that often spring up informally 
after meetings, where members revert to the more usual supportive friendship 
patterns, small talk, and detached pairing. 


10. Provision for Protecting the Group Subculture from Outside Pressures 
and Sanctions 


Further clues to the success of the leader in his radical task lie in the pro- 
tective devices used to shield the group from outside pressures. These will be 
discussed in some detail in the next section. 


Autonomy of Analyst 


Finally, the analyst succeeds, in spite of group pressures, resistances, and 
hostilities, because he is equipped by reason of rigorous training and personal 
analysis to stand firm. In addition, he is fortified with the internal reassurance 
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that he has the support of his professional peers and a professional rationale 
evolved from generations of cumulative experience. Thus equipped to main- 
tain affective neutrality even in the face of a concerted group attack, he can 
often retain sufficient detachment and insight to enable him to resolve a con- 
flict with a constructive interpretation. Ultimately the opposition must yield, 
be won over, or withdraw, thus leaving to continue with the group only those 
who can sustain the role requirements. The members of this intimate nuclear 
group, with growing cohesion and loyalty, gradually acquire the necessary 
skills and attitudes and a self-generating basis for conformity to the new role 
prescriptions. 


CONTEXTUAL RELATIONSHIPS OF THE GROUP UNIT 


The foregoing analysis of the internal processes by which the new norms 
are successfully established made only passing reference to the devices used 
to protect this deviant subculture from outside pressures. 

The need for protective mechanisms arises from the gross divergence of 
these special internal norms from the social codes usual in the middle and 
upper-class cultures from which group members are largely drawn and to 
which, after each session as well as ultimately, they must return. The typical 
external culture decrees, for example, that intense emotions should be largely 
controlled, not expressed; that only limited and acceptable topics may be 
freely discussed in most social groups; that people should minimize or 
altogether avoid unfavorable criticism of others; that people generally should 
be silent about the eccentricities and handicaps of others; that interpersonal 
behavior should conform to that socially defined as appropriate to the rank 
and status of the individuals involved; that antisocial conduct should not be 
openly condoned, etc. How, then, can the therapy unit consistently violate 
these norms without stimulating those outside the group, but closely involved 
in relationships with participants, from intense curiosity and the invocation 
of controls, pressures, and sanctions? Especially, how can neurotics, vulner- 
able as they are to the opinions of others, successfully maintain a continually 
interrupted deviant subculture? In the opinion of the author, the answers lie 
in secrecy of session content and membership, and in limitations on contacts, 
“acting out,” and goals. Each of these devices warrants close attention. 


1. Secrecy of Session Content 


Secrecy of session content is of central importance. It represents, in essence, 
the extension of the physician’s ethic of privileged communication to include 
the entire group proceedings. With the exception of revelations about them- 
selves, members are repeatedly enjoined not to discuss with outsiders the 
group activities and revelations (5). When violations of confidence do occur, 
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they precipitate group discussion and reprimand. Onlookers and visitors are 
seldom admitted, and only with the specific permission of the leader. 


2. Secrecy of Membership 


The second group defense, secrecy of membership, must supplement the 
first for, as Simmel has noted, no secret known by more than two is ever safe 
(11, p. 123). The group is purposefully made up initially of strangers, and 
as far as outsiders are concerned, the membership list is defined as confidential. 
These precautions are often supplemented by anonymity. Fellow members are 
sometimes known to each other only by their first names. Revelation of their 
precise occupational connections is not usually required—a specific exception 
to the norms listed in Table 1. Members are selected, moreover, to avoid over- 
lapping role and status-sets. In other words, both members of a teacher- 
student or employer-employee pair would not be admitted, nor would two 
employees from the same office nor two students from the same class of the 
same teacher. When, and if, exceptions are considered (the admission of hus- 
bands and wives, friends, work associates, etc.), the decision is cleared in 
advance with those participants who would be threatened by the arrangement. 
Thus, those who are privileged to become the confidants of this intimate 
circle are limited in almost all instances to those who have had no previous 
personal connection and will, in all probability, have none in the future. 
Mutual relationships thus follow a patterned fusion of intimacy and social 
distance that Simmel has identified as characteristic of the role of stranger (7). 
The practice reminds one of the spontaneous use of strangers as confidants 
by wives of men committed to mental hospitals (12, pp. 43, 46). 


3. Limitation on Outside Contacts 


To the protection of secrecy of content and membership is added the pre- 
caution of limitation of contacts between participants outside the restricted 
and clearly defined hours of sessions. The typical analyst does not fraternize 
with his patients or meet them outside agreed sessions. He also does not see 
other members of the patient’s family or other role-partners without the pa- 
tient’s knowledge and permission. This practice is extended to group par- 
ticipants and serves the dual functions of (a) protecting the analyst from 
cross-pressures, ambivalent identifications, and extraneous or excessive de- 
mands that might interfere with his other role or status obligations, and (b) 
protecting the patient from embarrassing role conflicts and inadvertent pres- 
tige crises through surprise identifications or breaches of confidence. Whether 
patients ma itain a similar isolation from each other outside the therapy 
setting is left to their discretion. Normally, such contacts are carefully kept 
well within safe limits by informal mutual understanding and accommoda- 
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tion. These limitations on contacts are especially needed by neurotics since, 
unlike the physically ill, they are not commonly excused from their normal 
social roles and are highly vulnerable to exposure by reason of their sensitivity 
to criticism. 

In thus adopting these several techniques of a secret society, group psycho- 
therapy practice has afforded an unexpected confirmation of Simmel’s hypoth- 
esis as to the circumstances under which secret societies arise. In Simmel’s 
own words— 


. . the secret society is the suitable social form for contents which still (as it were) 
are in their infancy, subject to the vulnerability of early developmental stages. A new 
insight, a young religion, morality, or party, is often still weak and needs protection, 
and for this reason conceals itself. 


. . . But the secret society protects the decaying as well as the growing development. 
The flight into secrecy is a ready device for social endeavors and forces that are about 
to be replaced by new ones, In these cases secrecy constitutes a sort of transitional stage 
between being and not-being (11, pp. 346-347). 


Like the secret societies described by Simmel, group psychotherapy incor- 
porates simultaneously both the emerging and the dying. It is specifically 
designed both to bring about the “death” of old inappropriate, but highly 
affect-tinged, ways, and at the same time to encourage experimentation with 
new ways, previously blocked. It is not strange, therefore, that it has adopted 


similar protections. By isolation it assures sufficient delay to allow the new 
deviancy to become firmly rooted before it is forced to face the impact of 
external cross-pressures. Consistent with these functions of secrecy, group 
members commonly sense a declining need for the strict enforcement of 
secrecy norms as they become increasingly secure in their new ways. 

In addition to these obvious protective devices, groups commonly adopt a 
different type of defense, namely concessions to the mores of the contextual 
setting. These include limitation on goals and acting out, and setting limits 
on secrecy norms. Each will be discussed in turn. 


4. Limitation on Goals 


Unlike the secret society, the group therapy unit does not become involved 
in external social action, power politics, religious activity, or any outward- 
directed activity. Thus it maintains maximum energy for intragroup experi- 
mentation and effort, minimizes provocation of out-groups, and avoids retalia- 
tion. Only as the personalities of members are changed or their emotions 
deeply aroused does the group threaten its contextual setting on any serious 
reality basis. 
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5. Limitation on “ Acting Out” 


A further concession to the contextual setting is the restriction on “acting 
out”—in other words, a limitation on the de facto violation of action norms in 
or outside the group. Usually an implicit or explicit distinction is made be- 
tween talking about antisocial wishes and following these wishes, between 
forgiveness for past conduct and permission to repeat the same mistakes, and 
between deviations permitted or encouraged within the group and customary 
behavior outside. While emotional and behavioral experimentation (‘reality 
testing’”’) in outside relationships often does—in fact must—accompany the 
therapeutic process to assure a successful outcome, this experimentation is 
usually guided, when possible, into channels where no permanent, irreversible 
harm will result. While some external tolerance for deviations, irregularities, 
and role failures is assumed, restraints are introduced when major moves are 
in question. Dollard reports, for example, that during critical periods the 
“patient is asked not to change his marital status, not to seek revenge, not 
to change his job or otherwise to unsettle his significant life relations” (2, p. 
248). In spite of these precautions, however, some tragic clashes with con- 
textual settings do occur. 


6. Limitation on Secrecy Norms 


Correlative with limitations on ‘‘acting out,” one finds a limitation on the 
obligation of secrecy placed on the analyst. According to a study by Little (3), 
most psychiatrists draw the line at respecting requests that they remain silent 
about murder, acts of sabotage, and suicidal indications. Exceptions to the 
secrecy norms are, however, extremely limited. In this same study, for 
example, more than half the psychiatrists participating reported that they 
did not feel ethically obligated to report embezzlement (3). Though no study 
has been made, presumably similar limitations would be placed on revelations 
in group sessions. The specific limits beyond which secrecy norms would not 
be honored would, of course, vary with the prevailing mores of the cultural 
setting. 


Incompleteness of Contextual Boundaries 


Together, the six foregoing devices erect marked boundaries between the 
therapeutic group and the outside world. They serve to cloak it, so to speak, 
with a coat of external invisibility. For neurotics, who tend to become affec- 
tively involved in multiple, conflicting relationships, they serve an essential 
function. Without such boundaries, members could readily become immobil- 
ized by cross-pressures and conflicting role requirements and would lose the 
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freedom, courage, and autonomy requisite for the radical tasks required. 
Nevertheless, the boundaries are incomplete in several important respects. 

For example, they protect in only one direction—like a one-way screen. 
They deflect the influences of the outside on the group, but (with the minor 
exceptions noted) not the impact of the group on its environment. What 
members encourage or recommend within the inner sanctum may become the 
dominant determinant of outside actions—with husband or wife, grown son 
or mother, employer or employee, suitor or fiancée. Many an intensely emo- 
tional real-life encounter is precipitated with group encouragement and 
reported back, in turn, to the peer circle. Somehow, however, in the transition 
from talk to action, society succeeds in bringing behavior within permissible 
cultural limits. 

Moreover, the protections restrict direct influences only. Indirect con- 
textual influences are a conspicuous aspect of group sessions, as, for example, 
when members use the group to express aggressive or depressed feelings 
originating in outside relationships. Through its invisibility, a group may 
perform an unrecognized but very real service to other groups by providing a 
safety valve for the release of feelings that otherwise might disrupt outside 
role performance. When conflicts are reexperienced outside their usual context, 
they can be more readily resolved because alibis and defensive projections 
become transparent and no spiral of retaliation is set in motion. These are 
clear, but usually unrecognized, contributions of therapy groups to the larger 
society in which they operate. 


Liabilities of Protective Devices 


While these patterned boundary devices serve essential functions for those 
on both sides, they also exact a price. Role-partners of participants, when 
thus isolated from communication, cannot possibly understand the stresses 
inherent in the role of patient. Consequently, they cannot temper reciprocal 
role expectations or accept temporary deviancy as legitimate. During periods 
of temporary upheaval, they cannot ‘“‘make allowances” or excuse temporary 
role failures or hostilities. Neither can they be alerted to newly developing 
capacities or comprehend demands for altered reciprocity patterns. To this 
extent, the boundaries prevent rather than facilitate the resolution of con- 
flicts with role-partners outside the group. In the larger context, the isolation 
of the therapy group also serves to retard the breakdown of prejudices, whose 
survival serves in turn to withhold from participants the social encouragement 
and support customarily given to the physically ill. 

Still further dysfunctional consequences of these protective devices stem 
from the resulting isolation of the therapist from the role-partners of par- 
ticipants. The pattern closes channels for communications that might bring 
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him warnings of probable unfavorable unanticipated consequences of thera- 
peutic interaction affecting the patient’s relations with his environment. The 
analyst is left solely dependent on the reports of members for information 
on their outside relationships—an obviously biased source. This socially pat- 
terned isolation of the therapist carries with it the risk that he will misinterpret 
the patient’s relationships with role-partners and other external conditions 
and hence may fail to make correct diagnoses or to appraise correctly available 
danger signals. 

Viewed from the point of view of the outside role-partners, the protective 
devices constitute still a further liability, for, by their enforced state of 
ignorance, they prevent those who may be deeply involved in outcomes 
from taking effective action to parry or control the real or imagined threats 
inherent in the radical group proceedings. At the same time reassurance, inter- 
pretation, or warning from the therapist cannot reach them because of the 
same communication barriers. Too often they are left isolated and without 
social support or guidance in times of great or unanticipated stress. Thus the 
protective norms, while solving one set of problems, create others. 

It must also be admitted that in some settings the problems posed for the 
group by the larger context can become insoluble, even with maximum use 
of the protective devices named. A gossipy small-town group or a church or 
friendship group, for example, would hardly provide a safe arena for revela- 
tions of socially taboo deviations or phantasies. One is reminded of the man 
who, after participation in a church-sponsored therapy group, withdrew from 
all church activities because as he said, ‘“‘too many of them know too much 
about me.” (1, p. 12) Group psychotherapy responses must be expected to 
vary with the cultural context and actually do (8). Some settings could make 
the process totally unworkable. In a police state, for example, attitudes of 
hostility and revenge toward civil authorities could hardly be revealed for 
fear that informants had secretly infiltrated. In such a setting, one might even 
witness the complete reversal of ends, the group being used for unscrupulous 
indoctrination or instigation to sabotage instead of for the welfare of the 
patient. As group psychotherapy extends to an ever-widening circle interna- 
tionally, the adaptations to different cultural settings could form the subject 
of an interesting cross-cultural analysis. 


POSTTHERAPY ADJUSTMENTS 


Of the puzzling questions posed for consideration at the outset of this 
article one still remains, namely: How can group psychotherapy adjust people 
to a society that does not follow the normative patterns used in such therapy? 
Judged by the role prescriptions outlined in the first article of this sequence, 
the uninitiated might predict that prolonged exposure to this atmosphere 
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would produce persons too unaccepting of social rituals, too frank, too unin- 
hibited, too indiscrete, too outspoken, too disrespectful of rank and status, 
too unrestrained, too nonconformist. If this should, in fact, be the result, the 
so-called cured would repeatedly provoke social disapproval, rejection, and 
sanctions. Thus rejected, they might end up ineffective, unhappy people, low 
in self-esteem and achievement. Why is this not the typical outcome? How 
are members actually returned to normal role performance? 


Posttherapy Corrective Processes 


The first explanation would seem to be that the members terminate before 
they have completely internalized the norms required for the corrective experi- 
ence. In most cases the original search for help was motivated by suffering, 
which in turn was due to inappropriate, rigid, and handicapping ways of 
relating to others or to themselves. When these maladaptive patterns are 
mitigated, the suffering is reduced and with it the intensity of the motivation 
to continue an otherwise painful therapy. As soon as motivation and interest 
become insufficient to offset the pain and expense, the patients will drop out. 
Usually, this built-in natural corrective leads to termination before members 
become so well socialized to the new deviant, but distinctly part-time, sub- 
culture of the therapeutic group that it in turn renders them ill-adapted for 
subsequent roles. 

A second corrective is provided by the outside social system itself, with 
which they never completely lose contact. Once the members are relatively 
free from the dead hand of the past—from excessive rigidities, self-defeating 
expectations, handicapping defense mechanisms, avoidances, and morbid anxi- 
ety—societal controls operating in their current social milieu can and will 
show themselves to prevent too great swings in the opposite direction. If, by 
error of the group process, members have been deconditioned to essential 
realistic fears or essential ethical practices, their social groups can be relied 
on to apply sanctions sufficient to reteach the essentials. Moreover, these new 
learnings, in contrast to the old, should prove appropriate to current status 
and role requirements. 

The final explanation lies in the increased skills which, if successful, the 
process has made possible. Fortified with increased awareness and insight, 
members in the posttherapy stage should be able to perceive correctly a higher 
proportion of social clues, to adapt realistically to essential changes in social 
roles, and to resolve role conflicts with less stress. Once trained to accept 
uncertainty and to operate from inner perception and conviction without 
external supports, they are better able to test and experiment to learn the 
real flexibility and limits of their current environment. These are probably 
greater than they ever dreamed. To the degree that previous roles are no 
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longer suitable, the more fortunate have gained sufficient initiative and 
autonomy to revolt and restructure more satisfying relationships elsewhere. 


Thus, to a degree, they create a milieu compatible with the core of their new 
learning. 


Unintended Consequences of Therapy 


For many the group experience is either too brief or insufficient. Often 
the process must be supplemented by individual therapy more intensive and 
individualized than that possible in a group setting. For some neither is suc- 
cessful. Patients may terminate or therapists fail in the requisite delicate 
skills involved. For the surgeon of the mind and emotions, as for the surgeon 
of the body, there is always an unintended but calculated risk of failure. 

But aside from an unavoidable quota of failures, can it be that positive 
therapeutic results might actually prove dysfunctional for the individual’s 
life outside? An affirmative answer seems in order. For example, the per- 
sonality changes produced by successful psychotherapy would be dysfunc- 
tional for a patient who, without a change in relative power status, had to 
return to the context in which his neurosis originated, and for which it served 
an essential adaptive function. For this reason, the prognosis is usually rec- 
ognized to be poor if the patient is returned to (or never leaves) his original 
family of orientation—unless other members of this unit also receive treat- 
ment or change under its indirect impact. To retain his gains, the patient must 
find an environment in which his new skills can succeed and be rewarded. 

As an exception, it should perhaps be noted that if a patient’s power status 
within his original group changes by reason of the maturation or aging of 
members, and if an adequate increase in strength has been gained through 
the therapeutic experience, he may be able to maintain his gains in his old 
environment in spite of other members’ expectations to the contrary. Should 
this happen, however, his gains may prove dysfunctional for his probably now 
aging parents. Their lifelong expectations as to reciprocal parent-child obliga- 
tions and behavior might well be shattered. Such a disillusionment could sub- 
ject parents to extreme emotional trauma. 

Similarly, the outcomes of group therapy can be dysfunctional for a mar- 
riage where family patterns are based on a dovetailing of neurotic traits. A 
partner, having learned new ways, may find himself radically out of adjust- 
ment with his spouse’s pattern. If complementary changes cannot be induced, 
divorce, retribution, or continual friction must be anticipated. Should separa- 
tion result, this might prove functional for one and highly dysfunctional for 
the other, as well as for the children of such a union. This is not an uncom- 
mon unanticiapted consequence of treatment of one partner. A spouse, after 
encouraging treatment for a partner to correct a difficult trait, may find in 
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the end that the whole marriage dissolves. The focus of the therapeutic 
process on the welfare of the individual patient, plus its deliberate isolation 
from the environment, may actually promote such a result. 

Other unintended but more temporary and less devastating consequences 
are regularly inherent in the process. These include attrition of performance 
in other roles and the impingement of “transference” attachments on the 
patterning of warmth and affection within the family circle. 

In a similar manner job performance may be adversely affected, as for 
example by a drain on energies of the employee, or loss in job priority or 
loyalty. New learnings may also be directly dysfunctional in an old context, 
as when a submissive employee of an authoritarian boss unlearns his excessive 
obedience. If the employer cannot adjust to a new type of relationship and 
the employee leaves, this outcome may prove highly dysfunctional for the boss, 
particularly if he has invested heavily in his training. If the outcome is not 
also to be dysfunctional for the employee, he will have to find himself an 
occupational niche for which docility is not a requirement. If this is not pos- 
sible and he does not have an independent income, he will have to unlearn his 
newly acquired aggressive skills, thus surrendering hard-won gains in intra- 
psychic harmony. 

In other more subtle and all-pervasive ways therapy may also prove dis- 
rupting to its social milieu. Stanton and Schwartz have pointed out, for 
example, some carry-overs in a mental hospital setting from the therapy 
hours to general institutional functioning—the “cult” of “directness,” ‘“sim- 
plicity,” and “nonapology,” the focus on “seli-improvement” rather than on 
“living,” a hesitation regarding giving advice or reassurance, etc. (10). Sim- 
ilar carry-overs probably characterize noninstitutional therapy but are harder 
to observe because they are dilute in their impact, and those affected are 
isolated from others with similar experiences. To the casual observer, occa- 
sional fiery outbursts are among the conspicuous examples. 

In short, both group and individual therapy are powerful tools. Their con- 
sequences occur in unexpectedly distant spheres, may be both functional and 
dysfunctional, and are unpredictable in detail. Ideally, these methods should 
be applied only by competent therapists and only to patients where the gains, 
both individual and social, may be expected to outweigh the unavoidable 
unfavorable side effects. When the prognosis is favorable for the patient, but 
the anticipated consequences will be dysfunctional for intimate role-partners, 
a conflict of values arises which must give participants real pause and heart- 
searching. At times, such conflicts of interest may be unanticipated by both 
patient and therapist. The therapist may overlook them by reason of his 
vocational focus, his patterned isolation, and his professional value-system 
which places supreme emphasis on the welfare of the individual patient. 
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Patients, intensely identified with their own problems, may fail to anticipate 
probable dysfunctional consequences for role-partners. Fortunately, human 
flexibility sometimes provides under pressure favorable resolutions of conflict- 
ing interests where none could have been anticipated. That improved post- 
therapy adjustment is often achieved and maintained in spite of such con- 
flicts and in an environment that fails, usually by a wide margin, to repeat 
the norms and structural patterns of the therapeutic circle may be called the 
third “social miracle” of group psychotherapy. 


POSSIBILITIES FOR VERIFICATION OF CENTRAL HYPOTHESIS 


Several avenues might be suggested for the testing and verification of the 
proposed hypothesis as to the therapeutic value of experience in a group that 
in selected key respects follows norms opposite to those in which maladaptive 
behavior was learned. The first would involve the study of the changes 
adopted through trial, error, or design when group therapy is tried with 
persons grossly different from the neurotic group on which the present paper 
has been focused. Are the same methods applicable to the treatment of 
delinquents, sex offenders, alcoholics, narcotic addicts, and various types of 
psychotics, or must there be a reversal of the norms here described for treat- 
ment to be successful with such grossly different, almost opposite, types? 
What adaptations are necessary when group therapy is undertaken in a 
residential setting where participants are together around the clock? A study 
of the natural and intentional adaptations evolved in the attempt to treat 
by group methods various different character types in assorted contextual 
settings could afford a fascinating cross-cultural study. The beginning litera- 
ture in this area is encouraging (9). 

A more direct research verification would be exceedingly complex and 
difficult, but would surely be exciting and probably fertile. There would seem 
no need, however, to await formal verification before experimenting with 
variations in therapeutic processes and settings suggested by the proposed 
hypothesis. In such experiments the sociologist would have available a socio- 
logical laboratory par excellence—a real social system dealing with real life 
problems and goals, a group of obvious practical significance, yet small, 
isolated, and temporary enough to be observed and within limits experi- 
mentally manipulated. Its research potentialities have only begun to be 
recognized. 


SUMMARY 


Because of considerations of length, this paper has appeared in two parts. 
The first section, which appeared in the March issue, outlined the norms 
characterizing psychoanalytically oriented group psychotherapy for nonin- 
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stitutionalized adult neurotics. It proposed the central hypothesis that an 
important key to the therapeutic power of group psychotherapy lay in the 
use of norms and the direct fostering of experiences which at the critical 
points where change was indicated were the opposite of those in which the 
neurotic patterns originated. The differences between group and individual 
therapy were outlined and the major assets and liabilities inherent in the 
use of the group milieu for therapy were itemized. 

In the present paper consideration has focused on the initial and terminal 
processes and on contextual relations. Three puzzling questions have been the 
center of interest, namely: (a) How is the analyst able successfully to impose 
on a group new norms that are the opposite of those in which its members 
typically have been socialized? (b) What are the socially patterned protective 
mechanisms by which this deviant subculture maintains itself in spite of its 
sharp divergence in norms from its contextual setting? (c) How can such a 
deviant process ‘‘adjust” participants to a society which does not follow the 
normative patterns used in such therapy? 

The keys to the analyst’s success in achieving the radical norm reversals 
are seen to lie in: (a) selection of members for hizh motivation, (b) maximi- 
zation of hope, (c) patterned functional requ ents for leaders, (d) the 
leader’s “transference” assets, (e) use of gradients in the introduction of 
norm-reversal requirements, (f) the leader’s differential access to channels of 
information, (g) affinity of required norms to some aspects of the prevailing 
value structure, (h) the internal psychological rewards accruing to members, 
(i) protection of the group from outside pressures, and (j) the autonomy of 
the analyst. 

The patterned protective devices used by groups to minimize outside inter- 
ference are seen to include (a) secrecy of session content, (b) secrecy of 
membership, and (c) limitation on outside contacts of members and leader 
with role-set and status-set associates of members outside the group. These 
devices are compared to those used by secret societies. In addition, provocation 
to outside interference is minimized by (a) limitation of goals to intragroup 
objectives, (b) limitation on “acting out,” and (c) placing of limits on secrecy 
norms. The patterned liabilities and assets of these devices are examined, 
together with their effectiveness in preventing interpenetration of boundaries. 

Finally, the two-way posttherapeutic correctional process of adjustment is 
described and hints are given as to the anticipated and unanticipated conse- 
quences, both functional and dysfunctional, of the process. 

The conceptualization proposed is recognized as yet undocumented or tested 
by formal research procedures. A study of adjustments spontaneously or 
planfully made as group therapy methods are extended to other contexts and 
grossly differing patient types is suggested as a fruitful next step. The use of 
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group psychotherapy units as quasi-experimental laboratories for sociologists 
is also proposed. While the emerging problems of group therapy practice, 
theory, and research are admittedly manifold and highly complex, this power- 
ful new therapeutic tool is seen to present to sociologists, as well as to other 
disciplines, an extraordinary opportunity and challenge. 
‘ 
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Some Consequences of Age Heterogeneity 
in Decision-Making Groups ' 


Rosert C. University of Delaware 
RALPH V. Ex ine, University of Delaware 


Status is most frequently defined as relative position in a hierarchy. More- 
over, it is generally agreed that age, sex, and race are status variables (13). 
Finally, it is frequently assumed that a power component is inherent in 
status as a social dimension. The study presented here tests this assumption 
by analyzing the relationship between a member’s age and power or influence 
in a group decision-making situation within the American culture. In addi- 
tion, the study compares the group decision-making processes and products of 
male and female groups homogeneous and heterogeneous with respect to age. 


PROBLEM 


It was postulated that in approaching a dynamic environment involving 
an array of familiar and unfamiliar elements, the tension level of a cognitive 
organism increases as a consequence of negative reinforcement in previously 
encountered unstructured situations. The organism acts to reduce tension by 
attempting to map or give meaning to the array, to anticipate future events, 
and to prepare a propitious course of action. Necessarily, the meaning of the 
array is derived largely from the familiar elements regardless of their relative 
obscurity (21). 

Accordingly, a member of a new group may be expected to seek clues to the 
behavior of the other members and their perceptions of his expected behavior 
or role. These clues in turn serve as guides for his own behavior and help 
minimize interpersonal anxiety stemming from the possibility of negative 
social-emotional reactions of other group members and threats to his own 
self-concept (16, 19). 

In new groups where there is little objective data concerning the members, 
necessarily the orientation procedure is superficial. Since certain status vari- 
ables such as sex, age, and race are manifest, the member’s initial orientation 
in a new group with regard to the group’s power structure is based to some 
extent on these phenotypic status differences within the group, particularly if 


1 Grateful acknowledgment is extended to Dr. John T. Lanzetta for his helpful, critical 
reading of the original manuscript and to Richard Behringer who assisted with the 
statistical analyses. 
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other cogent variables such as sociometric status and intelligence are rela- 
tively indistinct (3). 

Singling out age as the status variable for study and assuming that the 
older member is accorded the higher position in general in the American 
culture, it may be hypothesized that in a group decision-making situation the 
hierarchy with regard to age corresponds to the hierarchy with regard to 
power or influence. 

Several previous studies support this theoretical argument. Thus, Hemphill, 
Siegel, and Westie (11) found that male groups heterogeneous with respect 
to age tend to have older leaders, and that groups in which the behavior of 
the members is highly regulated also tend to have older leaders. Merton (15) 
and Stewart (18) also found some tendency for persons to seek advice from 
those somewhat older than themselves. However, in none of these studies 
was it possible to control for extraneous variables or to compare male and 
female groups. 

A second objective of the study was to explore the generality of homo- 
geneity as a group dimension. Hemphill (10) defines homogeneity as “the 
degree to which members of a group are similar with respect to socially 
relevant characteristics.” 

Two studies (4, 8) reported previously indicate that groups whose members 
are homogeneous with regard to personality characteristics tend to demon- 
strate more satisfactory working relationships. However, the results of experi- 
ments which tested the relationship between homogeneity (of personality 
variables) and group productivity were conflicting (8, 12, 17). In general, 
these studies propose that homogeneity of the members’ personality configura- 
tions reduces interpersonal difficulties (which may otherwise consume the 
limited energy potential of the group) and increases productivity. 

However, with regard to status homogeneity, Heinicke and Bales (9) sug- 
gest that the existence of a stable status hierarchy facilitates decision-making 
because members with unchallenged high status may play determining roles 
in resolving disputes. Moreover, they propose that when the problem of rela- 
tive status is settled, less of the group’s energy potential is dissipated in dis- 
guised status-striving behavior and consequently more energy is available 
for task-oriented behavior. 

With reference to age heterogeneity and new groups in particular, the 
status hierarchy may present a stable group superstructure sufficient to deter 
struggles for personal supremacy (ego-oriented behavior) but sufficiently 
flexible to permit wide latitude with respect to task-oriented behavior. Thus, 
it was hypothesized that groups whose members are heterogeneous (as op- 
posed to homogeneous) with regard to age are more satisfied with the group, 
the group processes, and products and perform at a superior qualitative level. 
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METHOD 
Subjects 


‘The subjects volunteering for the experiment were 72 teachers attending 
the summer session at the University of Delaware. The subjects ranged from 
twenty-one to sixty-three years of age. 


Experimental Procedure 


The subjects were assigned to one of 24 three-person groups according to 
the sex and age requirements of the four experimental conditions (see Table 
4). Twelve male groups were compared with twelve female groups. In addi- 
tion, half of the male and female groups were homogeneous with respect to 
age while the remaining groups were heterogeneous; that is, in the latter 
groups, there was a member in his twenties, another in the thirties, and a 
member who was forty or more. 

In order to control for the age differential between treatments, two of the 
homogeneous groups were composed of members in twenties, two groups with 
members in the thirties, and two groups in which all the members were forty 
or more. Moreover, the groups and members of each group were also matched 
with respect to undergraduate grade point average; no other estimate of 
general intelligence was readily available. 

The four experimental conditions then are simply male and female groups 
homogeneous and heterogeneous with respect to age. 

On entering the experimental room the subjects were introduced to each 
other and informed that they would be observed by an assistant through a 
one-way mirror. The subjects were told that through these observations the 
experimenters hoped to develop principles which would help groups to improve 
their decision-making ability. The members were informed that they would 
be asked to resolve two decision-making situations. Then, in an effort to 
control for years of teaching experience, it was announced to the subjects that 
“experience in teaching is not related to decision-making abilities in either of 
the two tasks.” 

During the initial phase of the first task, the members worked as individuals. 
After submitting written individual answers or decisions to the experimenter, 
the group was allowed a maximum of 20 minutes in which to discuss the 
problem and submit a group decision. Finally, the members privately sub- 
mitted individual estimates or decisions. 

On the second task, the members were permitted 30. minutes to submit a 
group answer. Following this, the group members completed questionnaires 
pertaining to their satisfaction with the decision-making processes and prod- 
ucts as well as a description of the roles performed by group members. 
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Finally, the answers to the problems were announced, and the experiment 
was discussed briefly with the group. 


Tasks 


In the first decision-making task, the group or individual was required to 
estimate the number of dots on a 16 x 21-inch card with 3,159 black dots 
scattered rather uniformly, yet in no geometric pattern, over a white back- 
ground. The card was exposed for only 5 seconds. 

The second task was an “education free” judgment problem in the unpub- 
lished “Intellectual Talents Battery” developed by Irving Lorge for the 
United States Air Force. In this exercise a problem was raised and 15 facts 
presented which had a bearing on the solution to the problem. The subjects 
were instructed to choose 6 of the 15 facts which were most important in 
reaching a decesion as to whether a state education board should develop 
extensively the use of television as a means of instruction, or whether to 
concentrate on motion pictures. The “correct” responses were determined by 
an internal consistency technique involving Air Force Officers as subjects. 


Measures of Group Satisfaction 


Following the movie-TV task the subjects responded to four Lickert-type 


questions concerned generally with member satisfaction with the group. The 
following items comprised the questionnaire: 
1. Consider the entire problem-solving session: My opinion was given the utmost 
consideration. 
. All things considered: I was extremely satisfied with the quality or degree of 
excellence of the decisions reached by my group. 


. The participants worked as a highly unified group rather than as a disjointed col- 
lection of individuals. 


. If new problem-solving groups were now formed, I would be extremely satisfied 
with working with the same group. 


Upon analyzing the responses to the items, it was found that the correla- 
tions between the weighted responses on each individual item with the total 
score (the sum of the weighted responses) ranged from .70 to .80, uncorrected. 
On this basis only the total scores were included in the analysis. 

A second measure of group satisfaction was derived from the private esti- 
mates of the members following the group decision. It was assumed that 
acceptance of the group decision reflects satisfaction with the group and 
group processes. Thus, the average deviation of the members’ private esti- 
mates from the group estimate was calculated. However, the correlation 
between the two measures of group satisfaction was zero. 
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Measurement of Power and Functional Roles 


In a recent study (14) it was found, in general, that common measures of 
influence do not yield identical results consistently and that in some cases the 
intercorrelations between rankings of individuals generated by the different 
measures tend to be quite low. However, two measures were found to be 
moderately highly correlated with most of the other measures of influence 
included in the study (particularly in new groups): an index of the relative 
frequency with which the members communicated during the group discussion 
and post interaction attributed influence of the group. The rank correlation 
coefficient, tau, between these measures was 0.58, n=28 (14). In order to 
obtain the first measure, the entire group discussion was recorded in accord- 
ance with the Bales method of interaction analysis (1). The unit of participa- 
tion involved in the measures of power was that described by Bales. The 
records of interaction were also analyzed in order to explore the functional 
roles of the members in various positions in the age status hierarchy. 

The second measure of power or influence was derived from four ratings 
on a 9-point scale by the members of the other members’ influence on the 
group, the quality of their ideas, the extent to which they assumed the role 
of organizer, and pleasantness. The total of the two ratings of each member 
was used as a score for each item. However, since the product moment cor- 
relations of the scores on the individual items and the total scores ranged 
from .65 to .77 (uncorrected), the total scores alone were retained in the 
analysis. 


RESULTS 
Power 


The analyses of the relations between the relative chronological age of the 
group member and influence are presented in Tables 1 and 2. In Table 1 the 
percentage of the group’s total interaction initiated by the members in the 
various age categories is analyzed with reference to the male and female 
groups and tasks one and two in a 2 x 2 x 3 factorial design. The results 
reveal a significant interaction between the sex of the groups and the age of 
the group member with regard to participation. Considering both tasks 
jointly, in the male groups age and amount of participation are directly 
related; the oldest male members contributed most frequently, while mem- 
bers in the twenties contributed least frequently. However, with regard to 
the female groups the youngest female tended to contribute most frequently, 
while the members in the thirties contributed least frequently. 

The second measure of influence investigated was the total of each mem- 
ber’s ratings by the remaining members with regard to the member’s behavior 
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TABLE i 


Analysis of Variance of the Group Members’ Percentage of Participation 
According to the Member’s Status, Sex of the Group, and the Group Task 


Task 1 (Dot) Task 2 (TV) Joint 


Male Female Male Female Male Female 
Status groups groups groups groups groups groups 


High 37.8 36.0 34.3 SS. 72.6 71.3 
Medium 36.3 31.2 31.3 70.1 62.5 
Low 31.4 38.1 36. 38. 68.1 76.3 


Note: Mean participation (n — 6) after each individual’s percentage had been adjusted 
by means of the arc sine transformation. 


Source of variation Mean square 


Task 

Sex 

Status 

Task x sex 

Task x status — 

Sex x status ‘ 3.87 (P= .05) 
Task x sex x status 

Within 


Total 


during the decision-making session. The results were not statistically signifi- 
cant (see Table 2). 

The results with regard to the Bales interaction profiles were not statisti- 
cally significant nor were any substantial trends apparent. (These tables were 
excluded in order to conserve space.) 


Homogeneous and Heterogeneous Age Groups 


The second objective of the study was to compare the behavior of male 
and female groups homogeneous and heterogeneous with respect to the age of 
the members. The relative accuracy with which these groups performed on 
each of the tasks is shown in Tables 3 and 4. With reference to the first task, 
the dot-estimate problem, it became necessary to apply the cube root trans- 
formation to the group error scores (the absolute difference between the 
group’s estimate and the actual number of dots) since, theoretically, the 
upper limit is infinity. The subsequent analyses of variance was found to be 
statistically significant, suggesting that homogeneity with reference to male 
groups is associated with inferior performance in comparision to heterogeneous 
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TABLE 2 


Analysis of Variance of the Power Ratings of the Members According to 
the Member's Status and the Sex of the Group 


Sex of Group 


Status Total 


High 
Medium 
Low 


114.2 
115.5 
119.0 


Norte: The figures represent the mean ratings of the members (n =6 in each cell). 


Source of variation 


Sex 

Status 

Sex x status 
Groups within 
Subjects within 


Mean square f F 


groups. The converse is true with regard to female groups. On the second 
task, only the findings with regard to the males were supported. (See Table 4.) 


However, the results of the over-all analysis of variance were not statistically 
significant. 


TABLE 3 


Analysis of Variance of the Dot-task Performance of Homogeneous and 
Heterogeneous Male and Female Groups 


Sex of Group 


Age distribution 


Female Total 


Homogeneous 58. 40.9 99.6 


Heterogeneous 61.4 95.1 


Note: Mean error score of groups in this cell (n= 6). 


Source of variation Mean square 


Sex 
Age distribution 
Interaction 

Within 


10.67 (P= .05) 


Totals 19 


t-test between homogeneous and heterogeneous male group = 3.18; p = .02. 
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TABLE 4 


Analysis of Variance of the Performance of Homogeneous and Heterogeneous 
Male and Female Groups on the Movie-TV Task 


Sex of Group 


Age distribution Male Female 


Homogeneous 3.2 4.0 
Heterogeneous 4.8 4.0 


Nore: Mean score of groups in this cell (n= 6). 


F 
Source of variation Mean square (required) 


Sex 
Age distribution 
Interaction 


t-test between homogeneous and heterogeneous male 


The relative satisfaction expressed on the questionnaire by the members 


under the four experimental conditions are presented in Table 5. The results 
of the 2 x 2 analysis of variance were not significant. Still it is interesting to 
note that the least satisfaction was expressed by the male homogeneous groups. 

A second test of the relative group satisfaction involved the private esti- 
mates of the group members following the group decision. A nonparametric 
2 x 2 analysis of variance design'was employed (20) involving age homo- 
geneity-heterogeneity and sex of the group. Interaction was statistically sig- 
nificant at the .02 level of confidence. Specifically, greater agreement with the 
group decision was found in the homogeneous male and heterogeneous female 
groups as opposed to the heterogeneous male and homogeneous female groups. 

March (14) suggests that as the structure of a new group stabilizes, the 
most influential members tend to reduce their participation relative to the 
other group members. Conversely, new groups in which the most influential 
members tend to reduce their participation relative to the other members 
may be assumed to have stabilized their structure. Thus, a test of the 
structural stability of the group is indicated. 

Toward this end an analysis of variance for groups within treatments and 
for repeated measures was calculated (6). First, the three members of each 
group were ranked according to their ratings of influence. Then the relative 
participation (transformed) of these members was categorized according to 


| a 
| 
205 
| 
Total 
8.8 
— 
| 
4.16 1 
eo 4.16 1 3.25 4.35 
Within 1.28 20 
Total 23 ~ 
group = 2.32; p= 05. 
re 
; 
| 
{ 
wary 


SOCIOMETRY 


TABLE 5 


Analysis of Variance of the Relative Satisfaction Expressed by Male 
and Female Groups Homogeneous or Heterogeneous with Regard to Age 


Sex of Group 


Age distribution Male Female Total 


Homogeneous 89.0 93.5 182.5 
Heterogeneous 94.0 95.0 189.0 


Norte: Mean group’s expression of satisfaction based on mean group member’s responses 
to questionnaire items 1, 2, 3, and 4. 


Source of variation Mean square 


Sex 

Age distribution 
Interaction 
Within 


Totals 


t-test between homogeneous and heterogeneous male groups = .92. 


the sex of the group, the age distribution of the group, and the group task 
(2 x 2 x 2 x 3 factorial design). The analysis is presented in Table 6. 

With regard to group structural stability, indicated by a decrease in the 
relative participation of influential members (interaction between “Task” 
and “Influence” in Table 6), the results were not significant. However, it was 
noted that the homogeneous groups, the male group, and the more influential 
members participated more frequently (p=.01, .05, and .0S respectively). 
Moreover, the homogeneous male groups were found to participate more fre- 
quently (p=.01) and the homogeneous groups tended to increase their par- 
ticipation more from task one to task two (p=.01). Finally, the high influence 
members of male homogeneous-age groups tended to participate more fre- 
quently relative to the other group members than did the high influence 
member in any of the other treatments (p=.01). However, it is necessary to 
point out that the aforementioned findings with regard to Table 6 hazard 
type one statistical errors since “subjects within” was used as the error 
variance rather than “groups within.” 

In yet another analysis, a comparison of the four experimental conditions 
was made with reference to the Bales profile of group interaction. Again the 
results were not statistically significant and no substantial trends were 
observed. (Once again, the tables are excluded for brevity.) 
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TABLE 6 


Analysis of Variance of the Total Units of Participation of Members Categorized 
According to Their Ratings of Influence, Sex of the Group, Age Distribution 
of the Group, and Group Task 


Source of variation Mean square 


& 


1. Sex 8051. 
2. Age distribution 12709. 
3. Task 41403. 
4. Influence 5438. 

11002. 

1x3 18. 

1x4 3579. 

2x3 2950. 

2x4 649. 

3x4 

lx3x4 

2x3x4 

1x2x3x4 

Groups within 

Groups within x task 

Subjects within 

Subjects within x task 


Total 


Nore: “(a)” indicates that Subjects within was used as the error term and “(b)” that 
Subjects within x task was used. Only 20 groups were included in this analysis because 
there were two instances in which two members of the same group received the equal 
influence ratings. Consequently these groups were eliminated and two other groups were 
eliminated randomly from the remaining treatments in order to equate the sample sizes. 


DISCUSSION 


In general the results indicate that the relative chronological ages of the 
group members is associated with some aspects of group and individual mem- 
ber behavior with reference to new groups, thus confirming to a degree the 
theory of categorization outlined at the outset. However, it must be acknowl- 
edged that age is a highly complex phenotypic variable, whose psychological 
and sociological components remain relatively unknown. 

Some indication of the complexity of the variable may be inferred from the 
contrasting and unpredicted results concerning influence obtained with refer- 
ence to men as compared with women (see Table 1). 

Thus, in the male groups, the oldest member assumed the key power 
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position (in corroboration of earlier studies involving male groups (11)); 
while in the female groups the youngest member held this position. With due 
regard for the subculture, public-school teachers, in which this study was 
conducted, these results lend a modicum of support for a rather general 
observation that in the American culture, the relative influence of the male 
increases with age, while on the other hand, the relative power or influence 
of the female decreases with age (7). However, the results with regard to 
females in the present study may simply reflect recent changes in child- 
rearing practices or social acceptance of an aggressive female role in recent 
years. 

On the basis of the foregoing discussion, let it be assumed that the mem- 
ber’s age and influence are two relatively independent status variables. The 
condition in which the member’s positions on each of the status variables are 
in phase or coincident is referred to as “status congruence” (2). The condition 
in which the member’s positions on each of the status variables are not in 
phase or coincident is referred to as “status incongruence.” In an earlier 
study (5), it was hypothesized and confirmed that status incongruence is a 
predisposing condition for role conflict, and, subsequently, decreased pro- 
ductivity in a decision-making situation. Applying the congruency concept to 
the results discussed above, the heterogeneous male groups may be described 
as congruent with regard to the status variables of age and influence (the 
oldest male also tends to be the most influential). However, the heterogeneous 
female groups tend to be incongruent with regard to the status variables of 
age and influence (the oldest female was not the most influential). In the 
present study, the results of the earlier study are confirmed, albeit a pos- 
teriori. The male heterogeneous-age groups (which are congruent with regard 
to age and influence) perform at a superior qualitative level when compared 
with female heterogeneous-age groups (which are incongruent with regard to 
age and influence). 

The second objective of this study was to test the generality of the homo- 
geneity hypothesis by comparing the results of the present study involving 
homogeneous-heterogeneous age groups with those reported by other investiga- 
tors involving personality variables (4, 8, 12, 17). 

However, since status incongruence evolved as an intervening variable, the 
assumption stated at the outset concerning greater role stability in the hetero- 
geneous age groups is acceptable only with regard to the male groups. There- 
fore, focusing on the male groups only, the second hypothesis is supported at 
least with regard to the productivity criterion (Tables 3 and 4). That is, the 
heterogeneous male groups in comparison with the homogeneous male groups 
reached decisions of higher quality in both decision-making situations. 


There is additional and substantial evidence supporting the assumption 
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concerning greater role stability in male heterogeneous-age groups. Thus, 
heterogeneous-age male groups were somewhat more satisfied with the group 
and the group’s problem-solving processes (see Table 5). On the basis of the 
results reported in Table 6, it may be inferred, in general, that there is more 
status-striving behavior in the homogeneous male groups (particularly on the 
part of the high-influence member) than under any of the other treatments. 
Finally, there was a tendency for the heterogeneous male group members 
privately to conform less to the group decision. Since there was no relation- 
ship between the two instruments designed to measure group satisfaction, 
the member’s private acceptance of the group estimate of the number of dots 
(a task with a minimum of objective data and thus encouraging final judg- 
ments based on social reality) was interpreted here as a measure of group 
pressures toward uniformity rather than group satisfaction. Thus, there is 
evidence that a status hierarchy presents a stable superstructure sufficient to 
suppress struggles for personal supremacy, at least with regard to male groups 
and with reference to age as a status variable. 

Finally, Hemphill (10), as cited at the outset, proposed that homogeneity 
is a fundamental group dimension. The findings of the present study tend to 
circumscribe, to a degree, the generality of this concept. The social variable 
with regard to which the group is homogeneous is a necessary qualification. 

However, it is also necessary to qualify the results of the present study 
in view of the subculture represented by the subjects, public-school teachers. 
Moreover, since the design involved a phenotypic status variable and it was 
quite impossible to control for all other status symbols, the findings may have 
been contaminated. These considerations underscore the restrictive nature 
of the findings, the generality of which can only be determined by further 
research involving subjects from various cultures and subcultures. 

Turning from the results to the more general topic of experimental design, 
the phenotypic variable, age, presents some rather unique characteristics rela- 
tive to small group research which at least warrant brief comment. Age is an 
immediately apparent, physiologically defined status variable, a peculiar 
characteristic held in common with sex and race in particular. One of the 
experimentally useful consequences of the potency of these variables is that 
the disadvantages of experimental inductions are avoided. Moreover, the 
stimulus effect of the variable is relatively continuous, thus creating some 
of the characteristics of a “stable” group rather than the typical new labora- 
tory group. Finally, these variables are all characterized by a concrete observ- 
able referent and, therefore, require no inferences which introduce severe 
problems of validity and reliability. At the group level, group size and sex 
heterogeneity are variables which may be similarly characterized. 

On the other hand, of course, there are some apparent difficulties introduced 
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by using a phenotypic variable. Conceptually it lacks clarity; that is, its 
psychological and sociological components remain undefined and indistinct. 
Then, too, an observer bias is inherent. Nevertheless, the unique character- 
istics of this variable and other phenotypic status variables (particularly sex 
and race which have the additional characteristics of being dichotomous) 
present some rather provoking possibilities for research involving group 
composition. 


SUMMARY 


This study was designed to explore some of the power correlates of a 
phenotypic status variable, age, in a decision-making situation; and some of 
the consequences of age heterogeneity with regard to adult male and female 
decision-making groups in the American culture. 

The results did not support the hypothesis that in a group decision-making 
situation the hierarchy with regard to age corresponds to the hierarchy with 
regard to power or influence. However, interaction effects between age dis- 
tribution and the sex of the group were significant. Thus, the oldest member 
demonstrated the greatest power in the male groups, but in the female group 
the youngest member held the key power position. 

The results with regard to age heterogeneity seemed to indicate that there 
is less role clarity (or more status incongruence) in the male homogeneous-age 
groups and female heterogeneous-age groups than in their counterparts and 


is associated with greater role conflict and lower group productivity. 
Some considerations relative to experimental designs which involve pheno- 
typic variables were also briefly discussed. 
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Personality, Social Choice, and Adults’ Perception of These 
Factors in Groups of Disturbed and Normal Children * 


ANTHONY Davips AND ANITA NEGRIN PARENTI 
Brown University and Emma Pendleton Bradley Hospital 


Many researchers have focused on the social structure of school groups, 
and have related the social status of individuals within the groups to measures 
of personality and adjustment. These studies (1, 3, 4, 11, 13, 14, 15, 16, 19) 
have found a general and quite consistent relation between possession of 
socially desirable personality traits and high social status among one’s group 
mates. Also, some studies have found that teachers had statistically significant 
success in judging the relative sociometric status of their pupils, as indicated 
by agreement with their standing based on classmates’ choices (1, 2,9, 10). In 
the present investigation we will attempt to extend these findings by studying 
similar relations in the social settings of a children’s residential treatment 
center and a normal summer camp. 

More specifically, this research has the following threefold purpose: (a) to 
investigate relations between social choice and measures of personality and 
emotional adjustment in groups of normal children and disturbed children; 
(b) to compare the normal and disturbed groups with respect to group 
structure as indicated by the degree of mutual choice in the sociometric 
patterns within the groups; and (c) to discover how accurately adults working 
with these groups of children could perceive their interpersonal relations. 


METHOD 


Subjects 


Three groups of Ss were studied. One group consisted of 48 emotionally 
disturbed children (38 boys and 10 girls) who were undergoing residential 
treatment at the Bradley Hospital in Riverside, Rhode Island. These children 


1We wish to express our appreciation to the director and child care staff of the 
Bradley Hospital for their cooperation in our study of the hospitalized children. We are 
also indebted to Mr. Edward E. Martin, superintendent, and Mrs. Louise Favorite, psy- 
chologist, for their assistance in our study of children in the East Providence public 
schools. Appreciation is also expressed to Mr. Murray Ortof, director, for making it pos- 
sible for us to study the children at Camp Hurley in Kingston, New York. This research 
was facilitated by a summer undergraduate research stipend awarded to Mrs. Parenti by 
the Social Science Research Council, and was supported in part by research grants M-1722 


and M-2087 to Dr. Davids from the National Institute of Mental Health, Public Health 
Service. 
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were subdivided into units of 7 to 15 children, with two adult child care 
workers assigned to each unit. The children in each unit ate and slept together 
and spent evening hours in their own lounge area. However, during many of 
their waking hours they engaged in an activities program that included ali 
children in the institution. The residential staff included a psychiatrist and a 
clinical psychologist who were familiar with all the children. 

A second group consisted of 80 normal children (47 boys and 33 girls) who 
were attending a summer camp in Kingston, New York. These children were 
subdivided into bunks of 7 to 12 children who were separated from each other 
daily for many activities, but who participated together in others. Each bunk 
had two counselors working with the group at all times. The counselors were 
supervised by a professional social worker who did not work directly with the 
children but who was well acquainted with all of them. 

A third group consisted of 57 normal children (23 boys and 34 girls) who 
were attending public school in East Providence, Rhode Island. This group 
was subdivided into a fourth-grade class of 12 boys and 18 girls, and a fifth- 
grade class of 11 boys and 16 girls. 

The mean age in the Bradley group was 10 years and 3 months; in the 
camp group was 10 years and 8 months; and in the school group was 10 
years and 1 month. Although the difference between the two normal groups 
is statistically significant, neither normal group differed significantly from 
the mean age in the disturbed group. 


Procedure 


Friendship choices. The children in each group were asked to name their 
three best friends on four occasions with a one-week period between each 
interview. The Ss in the emotionally disturbed group, on each occasion, were 
also asked to name the three children at the Bradley Hospital that they dis- 
liked the most. The instructions were as follows: “Please tell me the names of 
your three best friends or the children you like best at Bradley (school or 
camp). Tell me the one you like best, then the second best, and then the 
third best.” At each interview subsequent to the first one, the following 
instructions were added: “The people you name may or may not be the ones 
you liked best the last time. Be sure to tell me the three you like best now.” 

Scoring the friendship data. From these data a popularity score could be 
derived for each child. The measure of popularity was based on a weighted 
score, with 3 points awarded for each time a child was mentioned in the first 
position, 2 points for each time he was mentioned in the second position, and 
1 point in the third position. Since it was felt that the position in which a 
child was chosen by another child was an important indicator of his popu- 
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larity in the group, weighted scores rather than raw scores were used.” A 
similar procedure was followed in deriving a rejection score (based on the 
children’s statements of the Ss they disliked) for each child in the Bradley 
group. In addition, a noticeability score, based on the total number of choice 
points plus the total number of rejection points, was computed for each child 
in the Bradley sample. From these scores, three subgroups were formed 
within the Bradley sample, and two subgroups were formed within the camp 
sample. The Bradley groups were as follows: (a) the “most popular” 
group, consisting of the ten children who received the highest popularity 
scores; (b) the “most disliked” group, consisting of the ten children who 
received the highest rejection scores; and (c) the “ignored” group, consisting 
of the 13 children who received the lowest noticeability scores. Within the 
camp sample the following groups were derived: (a) The “most popular” 
group, consisting of the ten children who received the highest popularity 
scores; and (b) the “least popular” group, consisting of the 13 children who 
received the lowest popularity scores. 

In addition to the above scores, the data were scored for mutuality of social 
choice. There were six possible relations between reciprocal choices: (a) 
mutual first choice; (b) mutual second choice; (c) mutual third choice; (d) 
first choice-second choice; (e) first choice-third choice; and (f) second choice- 
third choice. The number of mutual choices that each child was invoived in 
was computed. 

Personality evaluations and adults’ perceptions of social rélations. The 
counselors and the social worker rated the camp children on the following 
eight personality traits: sociocentricity, optimism, trust, egocentricity, pes- 
simism, distrust, anxiety, and resentment. Previous research with adult Ss 
has shown these traits to be important determinants of the adequacy of the 
individual’s personal and social adjustment (5, 6, 7, 8). In the present study. 
comparing each child with other children of his age and sex, the adult rated 
him on each of these traits on a 6-point scale, running from “much less than 
most” to “much more than most.” In addition to the separate ratings on these 
traits, each child was rated on “general emotional maladjustment.” The 
reliability of these ratings of the camp children, as indicated by the correla- 
tion between the two raters, was .54, which is statistically significant. The 
Bradley children were rated on these same variables by a psychiatrist and a 
clinical psychologist, whose ratings showed a statistically significant correla- 


2 After employing this weighted score in all the statistical analyses, a correlation was 
computed between the weighted popularity score and the raw score based on the number 
of times an S was chosen, without consideration of the position in which he was chosen. 
The resulting coefficient was .98, indicating that the two scoring systems gave almost 
identical results. 
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tion of .77. Also, the adults working with each camp bunk or Bradley unit 
were asked to indicate whom they believed to be the three best friends of each 
child under their care. The personality ratings and adults’ perceptions of 
social relations were not obtained for the school group. 

Analysis of the data. The personality ratings, made by the social worker 
for the camp group and the psychologist for the Bradley group, were cor- 
related with the Ss’ popularity scores and with the mutuality of choice scores. 
The three groups were compared with respect to the proportion of mutual 
choices within ‘each group. Comparing the adults’ perceptions with the chil- 
drens’ actual choices within the Bradley and camp groups, the following per- 
centages were computed: (a) the number of times (out of the possible number 
of times) that the adult mamed the child’s best (second best or third best) 
friend in the same position that the child named that person; (b) the number 
of times (out of the possible number of times) that the adult named the 
child’s best (second or third best) friend in any of the three positions. 
Finally, the frequency with which the adults named a child as being the 
first, second, or third best friend of another child was computed, and each 
child was assigned a score which indicated the adults’ perception of his popu- 
larity within the group. For the Bradley and camp samples, correlations were 
computed between the actual popularity of each child (as indicated by the 
number of times he was chosen during the last interview) and the child’s 
popularity as derived from the adults’ perceptions. 


Hypotheses 


It was hypothesized that: (a) within both the camp and Bradley samples 
measures of popularity and of mutuality of social choice would be positively 
associated with possession of desirable personality traits and with degree of 
emotional adjustment; (b) in the Bradley sample, the “disliked” group would 
show poorer personal adjustment than would the “ignored” group; and (c) 
the degree of mutuality of social choice within the Bradley sample would be 
less than in the two normal samples. Although specific hypotheses were not 
formulated in regard to the accuracy with which the adults who worked with 
the children could perceive the social structure of the groups, it was believed 
that they would be able to anticipate the children’s social choices at better 
than a chance level. This aspect of the present experiment was largely 
exploratory in nature. 


RESULTS 


The correlations between the personality ratings and popularity scores are 
shown in Table 1. It was found, as predicted, that in both the Bradley group 
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TABLE 1 


Product-Moment Correlations between Sociometric Measures and Personality Ratings 


Bradley (N = 48) Camp (N = 80) 


Personality attribute Popularity Mutuality Popularity Mutuality 


General emotional 

maladjustment 317 .42 7 
Egocentricity 45 .19 * 357 
Pessimism 42 .24* 
Distrust ‘ .28 
Anxiety 437 .36 
Resentment 447 a .287 
Sociocentricity ae” 
Optimism 354 
Trust 


* Significant at the .05 level with a 1-tailed test. 
+ Significant at the .01 level with a 1-tailed test. 


and the camp group the most popular Ss were judged to be better adjusted 
emotionally. Also, it is evident that within both groups, social popularity 
tends to be associated with possession of positive personality attributes and 
absence of negative traits. 

In addition to these correlations, in the disturbed group the mean person- 
ality ratings of the most popular subgroup of Ss were compared with the 
mean ratings of the rest of the Ss in the group. As shown in Table 2, the 
popular Ss received personality ratings indicative of better general emotional 
adjustment, higher standing on possession of positive traits, and lower stand- 
ing on the negative or undesirable traits. The “most disliked” subgroup in 
the Bradley sample, as shown in Table 2, received ratings indicating that they 
are less well adjusted than the rest of the Bradley group on the eight per- 
sonality traits and on the general rating of emotional disturbance. The only 
findings of significant difference between the “ignored” subgroup and the 
remainder of the Bradley sample was on the trait of resentment where it was 
found that these ignored Ss tended to be more resentful. Further analyses 
were concerned with comparisons between the subgroups. As shown in Table 
3, the most popular group differed significantly from the ignored group on 
all traits except trust, distrust, and resentment. Moreover, the ignored group 
differed significantly from the disliked group on general emotional disturbance, 
resentment, trust, and optimism. In each instance, the ignored group obtained 
personality ratings indicating a more adequate pattern of adjustment than 
the disliked group. 
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TABLE 4 
Mutuality of Friendship Choices in the Three Groups 


Reciprocations 


Group Total N 


Bradley 48 
Camp 80 
School 57 


These results, which are in accord with the hypotheses, show that there is 
a significant degree of association between social status (popularity, rejection, 
and noticeability) and emotional adjustment, such that popular Ss tend to be 
better adjusted than Ss who are either ignored or disliked, and the disliked 
Ss tend to be more emotionally disturbed than are the ignored Ss. 

In analyzing the mutuality of choice in the three groups, each group was 
divided (using the data from the last interview) into those Ss who reciprocated 
one, two, three, or no friendship choices, regardless of the positions in which 
the children named each other. The groups were compared in regard to the 
per cent in each group falling into each of these categories of mutual choice. 
As shown in Tables 4 and 5, a significantly higher percentage of the Bradley 
children reciprocated none of the three friendships, and a significantly higher 
percentage of the normal children reciprocated all three friendships. It is note- 
worthy that the two normal groups did not differ on mutuality of social choice. 

Within the Bradley sample and the camp sample, the mutuality of choice 
scores were correlated with the personality ratings. As shown in Table 1, 
the correlations for both groups indicate that reciprocation of friendships 
is significantly associated with good emotional adjustment and possession of 
desirable personality traits. 


TABLE 5 


Statistical Comparison of the Percentage of Mutual Choices in the Three Groups 


Reciprocations 


Groups compared 


Bradley vs. camp 
Bradley vs. school 
Camp vs. school 


* Significant at the .05 level with a 1-tailed test. 
+ Significant at the .01 level with a 1-tailed test. 
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It should be pointed out that, in a way, the measures of popularity and 
mutuality are not independent, as it would be impossible to obtain a high 
mutuality score if the person were never chosen by others. However, it is 
possible for a person to be popular (chosen frequently by others) but to have 
a low mutuality score (choose people who do not choose him). Thus, these 
two measures are far from identical. It is noteworthy that in the camp sample, 
in every one of the eight instances, the mutuality measure correlates with 
the personality measure to a higher degree than does the popularity measure. 
Most striking is the finding that whereas the magnitude of the correlation 
between popularity and resentment is —.28 the comparable correlation 
between mutuality and resentment is —.71. The latter correlation accounts 
for a 42 per cent greater proportion of the variance. Findings such as these 
make it quite obvious that there is much to be gained in utilizing both of 
these measures in studies relating social choice and personality. 

Turning to the agreeemnt between the adults’ perceptions of the friendship 
patterns and the actual friendships indicated by the children’s choices, it was 
found that the Bradley unit leaders and the disturbed children agreed on 24 
per cent of the choices, and the camp counselors and normal children agreed 
on 30 per cent of the choices. These percentages indicate the proportion of 
times out of a maximum possible number of times that the adults could 
name any of the child’s three best friends without consideration of their exact 
position in his friendship pattern. As would be expected, the adults were best 
able to perceive the children’s best friends as indicated by the fact that they 
included the child’s actual best friend as orie of the three choices listed for 
the child in 34 per cent of the selections made by the Bradley unit leaders and 
54 per cent of the selections made by the camp counselors. When the exact 
position of the friends in the choice pattern were taken into consideration, it 
was found in the Bradley sample that the adults could correctly indicate 22 
per cent of the children’s best friends, 11 per cent of their second best friends, 
and 2 per cent of their third best friends. For the camp sample, the figures 
were 44, 11, and 3 per cent, respectively, for perceptions of first, second, and 
third best friends. 

In addition to studying the adults’ perceptions of friendship patterns, it 
was possible to compare the adults’ perceptions of each child’s popularity 
with his actual popularity. That is, the child’s “assigned popularity score,” 
based on the number of times he was mentioned by the adults as being the 
friend of another child, was correlated with the child’s “real popularity score,” 
based on the actual number of times he was chosen by other children. It was 
found that in the Bradley group the correlation was .63, and in the camp 
group the correlation was .52. Both of these coefficients are significant and 
indicate that in both the normal and the disturbed groups, the adults who 
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worked with the children were able to accurately perceive the social popu- 
larity of the children under their care. 


DISCUSSION 


The present findings provide further evidence that popular children tend 
to be well adjusted emotionally and to possess socially desirable personality 
traits. The fact that even within the disturbed group the relatively better 
adjusted Ss tended to be the most popular serves to demonstrate the gen- 
erality of these relations. 

Comparison of the mutuality of friendship choices shows that the social 
structure of the disturbed group differs from the normal groups. Clinically, 
a major symptom or sign of inner emotional disturbance is unhappy social 
relations, and the present experimentation has revealed significantly less 
mutuality of social choice in the disturbed group. Moreover, even in the 
disturbed group, the Ss who named as friends children who also named them 
were better adjusted emotionally and possessed the more desirable person- 
ality structure. It is evident that when intrapersonal adjustment is disturbed, 
interpersonal relations are also askew, and mutually rewarding social relations 
are characteristic of emotionally healthy individuals. 

In considering the personality attributes that are most highly related to 
the sociometric measures, it appears that the trait of resentment or aggression 
is very critical. The more resentful children tend to be disliked and are unable 
to form mutually satisfying friendships and unable to achieve high social 
status. Since they are rejected more often than other children, they must 
experience considerable frustration in their social relations. It seems likely 
that this unhappy situation is cyclical in nature. That is, the more resentful 
and aggressive the child is, the more he is rejected by others, and the lack of 
social rewards makes him all the more resentful. In discussing the relations 
between these measures of personality and social status, it should be empha- 
sized that correlations of the magnitude of .70, with relatively large groups 
of Ss, are quite unusual in personality-oriented research. Therefore, when, as 
with the normal group in the present study, such a degree of association is 
obtained between the trait of resentment and the absence of mutuality in 
friendships, it seems that we are gaining understanding of highly eel 
relations between crucial variables in everyday social living. 

The results of this study also suggest that the personality traits of trust 
and distrust are fundamental in the formation of social relations. Stemming 
possibly from inadequate early mother-child relations, it seems that emo- 
tionally disturbed children are less trusting of people, and this factor makes 
it difficult for them to form lasting and mutually satisfying friendships. 
Another noteworthy findings is that the children who are popular and who 
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have friendships that are reciprocated tend to be high on the trait of optimism. 
It appears that the children tend to seek out and like to be with other children 
who are happy, contented, and confident. 

Viewing the over-all pattern of relations between the various personality 
traits and the indices of social interaction, it appears that Ss who are high 
on the negative traits and low on the positive traits are alienated from others 
in their group. The' present evidence, based on behavioral measures of every- 
day interpersonal relations, is in keeping with previous results obtained with 
direct and projective methods of personality assessment. More specifically, 
previous research with college students has led to formulation of what has 
been termed an “alienation syndrome” (5, 6, 7, 8). It was found empirically, 
employing an “Affect Questionnaire” (5), that Ss who were high on any one 
of the five negative traits tended to be high on all of them and also tended 
to be low on the three positive traits. Theoretical analyses of the specific 
traits that composed this syndrome suggested that Ss who presented this per- 
sonality picture must be “alienated.” In the present study, we find that the 
children who possess these traits are actually alienated from their social 
group. They tend to be unpopular and often are actively disliked, and even 
the people whom they seek out as friends do not show mutual attraction 
toward them. These Ss are, then, truly alienated, and the most disliked Ss 
are even more alienated than the ignored Ss. Thus, it seems that it is better 
to go unnoticed than to be actively rejected. However, this latter finding is 
based solely on the data obtained from the group of disturbed children 
which is, of course, a group that in general is very high on alienation. It 
would, therefore, be interesting in further research to discover if similar 
results would be obtained in normal groups. 

In conclusion, it seems that the over-all results can be understood and 
explained from the viewpoint of Horney’s theoretical notions (12) about 
neurotic conflicts and attempts at solution. It may be, in terms of Horney’s 
typologies, that the ignored Ss are the type who move away from others, the 
disliked Ss move against others, and the popular Ss move toward others. 
The relations found between the personality variables and the measures of 
interpersonal relations certainly suggest that this is actually the case. Thus, 
all in all, the present findings are quite encouraging and suggest that addi- 
tional research may well provide considerable further understanding of 
relations between individual psychopathology and group dynamics. With this 
increased knowledge, we may become better able to utilize both individual 
psychotherapy and social therapy in the attempt to make these disturbed 
children more capable of living peacefully with themselves and effectively 
with others. 
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SUMMARY 


The purpose of this research was: (a) to investigate relations between 
social choice and measures of personality in groups of normal children and 
emotionally disturbed children; (b) to compare the degree of mutual choice in 
the sociometric patterns within the disturbed and normal groups; and (c) to 
discover how accurately adults working with these groups of children could 
perceive their interpersonal relations. 

It was found, both in the disturbed group and the normal group, that 
social popularity was significantly associated with good emotional adjustment, 
possession of positive personality traits, and absence of negative traits. In 
the group of emotionally disturbed children, it was found that the disliked Ss 
tended to be more emotionally disturbed than were the ignored Ss. 

A significantly higher percentage of mutual choices was found within the 
normal group. Moreover, the mutuality of friendship scores were significantly 
correlated with good emotional adjustment and possession of desirable per- 
sonality traits. In both samples, highly significant correlations were obtained 
between the child’s “assigned popularity score,” based on the number of 
times he was mentioned by an adult as being the friend of another child, and 
the child’s “real popularity score,” based on the actual number of times he 
was chosen by other children. Some theoretical and practical implications of 
these finds were discussed. 
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A Configurational Approach to Prediction 


RosBert P. StucKertT, Ohio State University 


Current prediction studies are generally based upon methods that require 
the quantitative measurement of social factors. This approach rests on a series 
of assumptions often only vaguely recognized. The measurement of socio- 
logical concepts and factors imposes such restrictions as the equality of units, 
dimensionality, and linearity. The current emphasis on social measurement 
tends to obscure the point that “we may or may not wish to embody these 
assumptions in our theory; the point is that we almost always test our theory 
in terms of such measures, and hence in terms of such assumptions” (8, p. 5). 
The predictive power of a set of factors may be diminished if the interrela- 
tionships among these factors are distorted when such assumptions are 
imposed. 

The research discussed in this report centered around two problems: (a) to 
develop a prediction method that avoids the assumptions involved in social 
measurement; (b) to compare the predictive efficiency of this method with 
that of other commonly used methods. Although data pertaining to scholastic 
achievement at The Ohio State University were used, the primary purpose 
was to test alternative prediction methods rather than to construct an instru- 
ment for a specific application. These data were used because they provided 
for replication and were unusually complete and reliable. Prediction methods 
based on the theory of multiple linear regression (6, App. 5), and the prin- 
ciples developed by S. Glueck (2, Chap. 20) and E. W. Burgess (1) were the 
other methods tested in this research. 


THE METHOD OF PREDICTIVE CONFIGURATIONS 


The method of predictive configurations as developed in this research is a 
variation of the prediction-by-classification approach. It is designed to pre- 
dict a criterion with discrete categories from a set of discrete or continuous 
factors on the basis of the principle of maximum probability (3). Predictive 
accuracy is assumed to be a function of the homogeneity of the groups being 
studied as well as the number of variables used in predicting. To construct a 
prediction instrument, a sample is divided into relatively homogeneous sub- 
samples according to factors related to the predicted outcome. This increases 
the likelihood that each case included in a subsample has an equal chance of 
achieving a given outcome. Each subsample can be regarded as a separate 
unit for statistical analysis since it is defined by a unique configuration of 
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factors. The statistical values needed to predict are computed for each sub- 
sample rather than the total sample. 

In addition, the factors used in subdividing the sample are selected so that 
the probability of a given outcome occurring exceeds an arbitrary value, 
called the critical value, for each subsample. This results in the criterion 
being highly concentrated about some alternative outcome rather than dis- 
persed throughout its entire range. Perfect predictability can only be approxi- 
mated if a limited number of factors are used. A probability of a given out- 
come occurring that exceeds the critical value is assumed to be the best such 
approximation. The number of predictors is minimized by having each con- 
figuration include the fewest needed to exceed the critical value. Only con- 
figurations of factors with probabilities exceeding the critical value are used 
for predicting. In a given subsample, the criterion category associated with 
the largest probability is the predicted outcome for all cases in the subsample. 

Predictive configurations based on a given sample should satisfy three 
requirements. First, the number of configurations in an instrument should be 
conveniently small to prevent the system from being unwieldy. Second, the 
proportion of the sample that cannot be classified under one of the predictive 
configurations should be relatively small. Third, predictive configurations 
should be mutually exclusive. There can be only one outcome predicted for 
each individual in the sample. Conversely, the predicted outcome is the same 
for all individuals in each subsample. 


Operational Procedure 


The procedure for constructing a prediction instrument of this type is 
relatively simple. No complex statistical computations are necessary. Each 
step of the procedure and the way it is applied in constructing an instrument 
designed to predict scholastic success are described. 

The first step is to establish the critical probability value to be used in 
identifying predictive configurations. Ideally, a single value is used for all 
categories of the criterion. In general, the critical value should exceed the 
level of predictive accuracy desired by the investigator. This is not a hard and 
fast rule since it may not be possible to use a single critical value for all cri- 
terion categories in an empirical study. The number of categories in the 
criterion and the proportion of the total sample in each category are two 
factors influencing the selection of critical values. In the illustrative problem, 
the sample includes 500 freshmen entering the College of Commerce and Ad- 
ministration, The Ohio State University. In this sample, the probability of 
completing two years of course work successfully is .56; the probability of 
failing is 44. If the values .90 and .80 are chosen as critical values, the 
accuracy of the instrument should be approximately 85 per cent. Any con- 
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figuration of predictors associated with a probability of passing greater than 
.90 or a probability of failing greater than .80 is to be included in the pre- 
diction instrument. 

The second step is to select the initial predictive factor. An array of proba- 
bility values representing the distribution of the criterion is computed for 
each category of each factor (see Table 1). Any factor with one or more 
categories with probabilities exceeding the critical value may be selected. 
‘These categories constitute the first item of the instrument. The cases included 
in these categories are considered predictable cases and removed from the 
total sample. They are not utilized in any of the ensuing steps. If no factor 
meets this requirement, the one having the category with the largest proba- 
bility value is designated the initial predictor. 

Data included in the academic achievement problem indicate that an indi- 
vidual’s score on the Ohio State Psychological Examination, an indicator 
of general academic ability, is the factor most highly associated with academic 
achievement. The computed array of probability values and the number of 
cases in each category are given in Table 1. The probability of passing of indi- 
viduals with scores of 80 to 99 is .94. Since this exceeds the corresponding 
critical value (.90), this category of the Ohio State Psychological Examina- 
tion becomes the first predictive item. The fifty cases in this category are 
removed from the sample as predictable. 


The third step is to subdivide the remaining categories of the initial predictor 
by classifying the cases in them according to additional factors. The proba- 
bility of each outcome occurring is computed for the resulting subsamples. 
Adding factors produces increasingly homogeneous subsamples with two 
results: (a) these subsamples have probability values exhibiting a gradient 
pattern, and (b) the size of the subsamples becomes progressively smaller. 
The purpose of this procedure is to add the factor producing the greatest 


TABLE 1 


Probability Values of Categories of Ohio State Psychological Examination 


Ohio State 
Psychological 
Examination 


Probability 


| 
| 
: 
80-99 50 .94 .06 
= 60-79 80 .78 .23 
40-59 120 .34 
20-39 90 .49 
a 0-19 160 .30 .70 
Total 500 .44 
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shift in the magnitude of the probability values and the smallest decrease in 
subsample size. Cases defined by any configuration of factors with a proba- 
bility value greater than the critical value are considered predictable. Con- 
figurations of factors having probability values significantly less than the 
critical value are further refined. Factors are added until either the number 
of remaining cases becomes extremely small or the further division of a 
subsample does not result in significant changes in the probability values. 
At this point, the remaining cases are considered nonpredictable and placed 
in a residual class.! Factors which are effective in refining one configuration 
may have little effect on the probabilities associated with another. 

Although there is no exact procedure for selecting additional factors, four 
general rules are suggested as a starting point. It may become necessary to 
deviate from them when a relatively small proportion of the original sample 
remains to be analyzed. 

1. Select factors having categories with relatively high probability values. 
Add these factors in the order of their highest probability values. 

2. Add a factor only if the category with the largest probability value in- 
cludes a significant proportion of the remaining sample. Although this pro- 
portion depends on the size of the original sample, this research indicates that 
at least 20 per cent of the remaining sample should be in this category prior to 
adding the factor. 

3. Select factors exhibiting a high degree of heterogeneity for cases included 
in the “favorable” category of the criterion. Earlier prediction and diagnostic 
studies indicate that the ability of a factor to discriminate between criterion 
groups is a function of the degree to which the favorable or successful group 
is dispersed throughout its range. 

4. If several factors possess similar properties according to the above rules, 
select factors containing relatively few categories. This tends to minimize the 
number of possible configurations. 

To proceed with the academic success illustration, further refinement of 
the remaining categories of the initial factor, Ohio State Psychological Ex- 
amination, is necessary. Since 122 or 27 per cent of the 450 remaining cases 
fall in the “excellent” category of the factor Promise as a university student 
(as rated by high-school principal or teacher) and have a probability of 
passing of .75, it is selected for constructing additional configurations. For 
example, the cases in the “40 to 59” category of the initial factor are classified 


1 The occurrence of many of the cases in the nonpredictable class may be due to the 
use of relatively crude criterion categories or relatively unreliable indicators of the 
predictive factors. Theoretically, these cases may have no real existence in the population as 
defined by reliable indicators. 
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Ficure I. Probability values of configurations involving students with Ohio State 
Psychological Examination scores of 40 to 59 


Ohio State 
Psychological 
Examination 


Promise as a Excellent 
university (N 24) 
student -92 .08 


Best subject 
in high school 


Occupational 
preference 


Business/ 
Executive 
(N 41) 


-95 


05 


40-59 
120) 


-66 


-34 


Average 
or Poor 
(N 86) 
.44 


Social Science, Commercial 
Foreign Language, or 

Science 
(N 53) 


-83 


Vocational 
Subjects 
(N 33) 

.88 


Scientific/ 
Technical 
(N12) 
-42 -58 


Nore: Probability figures represent probability of passing and probability of failing 


respectively. 


according to this new factor. This classification is presented graphically in 
Figure I showing the number of cases and the proportions that passed and 
failed in each subsample. The probability of passing of the subsample de- 
fined by an Ohio State Psychological Examination score of “40 to 59” and a 
Promise as a university student of “excellent” or “good” is .91. This pair of 
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predictive factors is included as the second item of the instrument. The 34 
cases in this subsample are removed from the remaining sample. 

Neither probability value of the subsample defined by the other two cate- 
gories of Promise as a university student is sufficiently large for predictive 
purposes. Additional factors are tried as possible bases for dividing this sub- 
sample. Figure 1 indicates that adding the factors Best subject in high school 
and Occupational preference produces two additional predictive configura- 
tions. The first combines the Commercial Subjects and Vocational Subjects 
categories of the former factor with the two previously included predictors 
and has a probability of failing of .88. The second adds the Social Science, 
Foreign Language, and Mathematics categories of Best subject and the 
Business/Executive category of Occupational preference to the first two pre- 
dictors. This additional refinement raises the probability of passing from .83 
to .95. The remaining twelve cases are assigned to the residual class as non- 
predictable. The other categories of the initial predictor are refined in similar 
fashion. If the above factors have no appreciable effect on the probability 
values of the remaining categories, other factors must be tried. In this study, 
no single factor was included in every predictive configuration other than the 
initial predictor. Only one other factor, Rank in high-school class, was in- 
cluded in over 40 per cent of the configurations. 

The end product of this procedure is a series of subsamples defined by con- 
figurations of predictive factors each containing a minimal number of pre- 
dictors. A nonpredictable or residual class contains the cases that cannot be 
assigned to a predictive configuration. The derived set of configurations con- 
stitutes a prediction instrument for use with subsequent samples. The residual 
class may be used for predictive purposes if its largest probability is sig- 
nificantly different from .50. To predict for a new sample, each individual is 
assigned to either a predictive configuration or the residual class. The pre- 
dicted outcome for a given individual is the outcome with the largest proba- 
bility for the configuration to which he is assigned. For example, an individual 
whose academic outcome is unknown is selected from a group of new students. 
The following information is available in his records: 


Ohio State Psychological Examination Score 45 
Rank in high-school class Middle third 
Promise as a university student Average 

Occupational preference Social service 
. Social adjustment Good 
. Best subject in high school Industrial arts 


This student can be assigned to a predictive configuration on the basis of 
items 1, 3, and 6. Since the criterion category more highly associated with 
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this configuration is “failing,” the prediction for this student is that he will 
fail (see Figure 1). 


Formal Properties 


The method of predictive configurations differs from prediction methods 
based on measurement in several ways. First, the predictors and the criterion 
may be qualitative variables. The necessity of empirically quantifying social 
factors is minimized. Unlike measurement methods, there is no need to im- 
pute such characteristics as equality of units or dimensionality to all the 
factors used in predicting. The configuration method restricts the type of 
criterion that can be used. A continuous variable must be categorized and a 
discrete variable having more than three categories is unwieldy. 

Second, the method of predictive configurations emphasizes the use of 
distinct, homogeneous subsamples. Each subsample is highly associated with 
a specific outcome. This contrasts sharply with methods designed to differ- 
entiate among cases clustering about a modal value of the criterion in a 
single sample. It does not assume, as do prediction-by-measurement methods, 
that there exists a single set of predictors or a single set of factor weights 
that is best for all individuals in a given sample.” 

Third, it provides for the nonadditive, nonlinear interaction of factors. 
Since each subsample is defined by a unique configuration of predictive fac- 
tors, it is unnecessary to assume that the relationship between any factor 
and the outcome is constant for all individuals. The manner in which a factor 
differentiates among individuals with respect to the criterion in one predictive 
configuration may differ from that in another since its effect may depend on 
the other factors in the configurations. This differs from measurement methods 
that assume that instruments each measuring a single dimension with all other 
conditions held constant can be designed. The use of measurement techniques 
also assumes that the meaning of an individual’s score on an instrument is 
independent of his scores on other measures included in the prediction 
system. For example, if a measure of academic aptitude is found to be corre- 
lated with scholastic achievement, a given score is assumed to “contribute” 
equally to the degree of achievement of all persons having that score regardless 
of their other attributes. 

Fourth, the predictive factors need not be universal and general throughout 
the population. An individual is assigned to a predictive configuration only 
if he is characterized in a meaningful fashion by the factors defining the 
configuration. Prediction-by-measurement assumes that every member of the 
population is characterized to some degree by every factor. If measuring de- 


2 The implications of this assumption in terms of probability theory are discussed in (5). 


+ 


SOCIOMETRY 


vices can be constructed, every individual can be tested and given a numerical 
score for every factor in the system. 


DESIGN OF RESEARCH 


To determine the utility of the method of predictive configurations, instru- 
ments of this type were constructed and their effectiveness compared with 
that of other types of instruments. Data relating to the scholastic achievement 
of students in the College of Commerce and Administration, The Ohio State 
University, were used in this study. The criterion sample used in constructing 
the instruments consisted of 568 first-year students entering the college in 
the autumn of 1948. The accuracy and efficiency of these instruments were 
ascertained through the use of two validation samples. The first included 499 
freshmen entering the autumn of 1949; the second of 498 entering the follow- 
ing year. Since many types of factors may influence scholastic achievement, 
a relatively wide range was used. Mental and personality test scores, intel- 
lectual achievements, social and family background factors, teachers’ and 
counsellors’ ratings, academic and vocational interests and preferences, and 
initial college experiences were included. 

Two predictive configuration instruments were constructed and fully 
tested. The first was designed to predict a dichotomized criterion. The cumu- 
lative numerical grade earned by the student in all courses during his first 
six terms of academic work was used as the criterion of scholastic achievement. 
The cumulative point-hour ratio® of 1.75 was selected as the cutting point 
between “passing” and “failing.” This instrument consisted of 16 predictive 
configurations. In no instance were more than five predictive factors included 
in a configuration. Since the criterion was a continuous variable that had been 
dichotomized rather than a discrete variable with two alternatives, border- 
line cases having point-hour ratios near the cutting point were not used in 
constructing the instrument. In the 1948 criterion sample, only persons with 
point-hour ratios less than 1.50 or greater than 2.00 were included in the 
“failing” and “passing” categories respectively. The second instrument in- 
corporated a borderline category as part of a three-fold criterion. The cri- 
terion categories were “failing” (point-hour ratio of 0.00 to 1.49), “border- 
line” (1.50 to 1.99), and “passing” (2.00 and over). This instrument 
consisted of 16 predictive configurations each containing one to five factors. 


8 The cumulative point-hour ratio is equal to 
z= 
where n, = number of credit hours of a course 
p; = quality points received in the course 
At The Ohio State University, the number of quality points received by a student is 
determined by his grade in the course: A—4, B=3, C=2, D=1. 
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To test the relative effectiveness of these instruments, three additional 
instruments were constructed. These were based on the multiple linear re- 
gression (6, App. 5), the Glueck (2, Chap. 20) and the Burgess unit- 
weighting (1, 7) methods. They included 4, 5, and 21 predictors respectively.* 


RESULTS 


The evaluation of instruments based on the configuration method involved 
three problems. The first was whether or not these instruments would assign 
a large proportion of a sample to the residual or nonpredictable class. Their 
usefulness is inversely related to the size of this class. In this study, over 
80 per cent of each sample were classified under the predictive configurations 
included in each instrument. In the case of the instrument designed to pre- 
dict a dichotomized criterion, the relative size of the nonpredictable class did 
not increase appreciably in subsequent samples. Approximately 14 per cent 
of the criterion sample and 15 per cent of each validation sample fell in the 
residual class. A greater increase in the size of the residual class was noted 
in applying the instrument designed to predict a criterion with three alterna- 
tives. Approximately 11 per cent of the criterion sample and 18 per cent of 
the two validation samples were assigned to this class by this instrument. 

The second problem involved the effectiveness of configuration instruments 
relative to other types. The accuracy of an instrument is the proportion of the 
total sample predicted correctly through the use of the instrument. The 
relative efficiency of an instrument is the proportional reduction in the error 
of prediction achieved through its use. Since it is possible to predict with 
some degree of accuracy without an instrument, the use of one is justified 
only if it results in more accurate prediction. In this research, the proportion 
of a given sample included in the largest category of the criterion was used 
as the standard for computing the reduction in prediction error. Of the 
instruments for predicting a dichotomized criterion, the predictive configura- 
tion instrument yielded the most efficient predictions in the two validation 


*A linear regression equation including six factors was derived from the data for the 
1948 sample. This equation yielded slightly less accurate results than did the four-factor 
equation. 

5 Of the 568 students in the 1948 sample, 349 or 61.4 per cent passed. Without an instru- 
ment, the best prediction for an individual drawn at random would have been a “pass.” 
This procedure would have resulted in 349 correct and 219 incorrect predictions or an 
accuracy of .614. The use of the configuration instrument resulted in 443 correct and 125 
incorrect predictions: a decrease of 94 in the number of incorrect predictions. Hence, the 
instrument had a coefficient of efficiency of 94/219=.429. The proportions of the 1949 and 
1950 samples in the “pass” category were .639 and .656 respectively. When the criterion 
was a trichotomy, the “pass” category contained the largest proportion of the 1948, 1949, 
and 1950 ‘amples: .477, .506, and .523 respectively. 
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TABLE 2 


Coefficients of Accuracy and Efficiency of Alternative Prediction Methods, 
by Type of Criterion 


Accuracy Efficiency 


Prediction 1948 1949 1950 1948 1949 1950 
method sample sample sample sample sample sample 


Dichotomous Criterion 
Predictive configuration .804 
Multiple linear regression 
Glueck 


Burgess unit-weighting 


“wear 


* 


775 
.773 
748 * 


Trichotomous Criterion 
Predictive configuration .673 .615 .592 
Multiple linear regression .597*  .567 .608 


*Difference between this coefficient and corresponding coefficient of predictive con- 
figuration method is statistically significant at the .0S level determined by the proba- 
bility distribution of the difference of two proportions based upon large samples, two- 
tailed test. 


samples (see Table 2). In the 1949 sample, the efficiency of the configuration 
instrument was virtually equal to that of the multiple regression instrument 
and slightly superior to that of the Glueck instrument; the coefficients of 
efficiency were .324, .318, and .291 respectively. The Burgess-type instrument 
was the least efficient (.179). Only the difference between the configuration 
and Burgess instruments was statistically significant.® In the 1950 sample, 
the differences in efficiency were greater. The configuration instrument had a 
coefficient of .424 compared with coefficients between .267 and .345 for the 
other three instruments. The difference in efficiency between the configuration 
instrument and each of the others was statistically significant at the .05 level. 

Only the configuration and multiple linear regression methods were used 
to construct instruments for predicting a criterion with three categories. 
Neither instrument predicted this criterion as accurately and efficiently as it 
did the dichotomized criterion. In the 1949 sample, the configuration instru- 
ment was appreciably more efficient than the regression instrument. The 
coefficients of efficiency of the two instruments were .179 and .122 respectively. 
In the later sample, the configuration instrument was slightly less efficient 
than the other. The corresponding coefficients were .143 and .177; this dif- 
ference was not statistically significant at the .05 level. 


6 The significance of differences was determined through the use of the probability dis- 
tribution of the difference of two proportions based upon large samples, two-tailed test (4). 
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The third question concerned the utility of an instrument in predicting for 
the subsample in each criterion category rather than the total sample. Although 
one instrument may be less accurate than another in predicting for the entire 
sample, its ability te predict the outcome for certain types of individuals (such 
as academic “failures” in this study) may be superior to the other. Table 3 
gives the proportion of each criterion group predicted correctly by the instru- 
ments used in this study. A consistent pattern is to be noted in these data. 
The accuracy of the configuration method in predicting the outcome of cases 
in the “passing” category of the dichotomized criterion (.874) was only 
slightly superior to that of the linear regression (.861) and the Glueck (.867) 
instruments. In predicting for cases in the “failing” category, the first instru- 
ment was more accurate than the others. The configuration instrument pre- 
dicted .701 of the cases in this category correctly. The regression instrument 
was second with .662 of the resulting predictions being correct. The other two 
instruments were significantly less effective in this respect. With a trichotomous 
criterion, the multiple linear regression instrument was more accurate than the 
configuration instrument in predicting for cases in the extreme categories of 
“passing” (.829 vs. .808) and “failing” (.556 vs. .481). The configuration 
instrument was far superior in predicting “borderline” status. It should be 
noted, however, that neither method was able to predict correctly the majority 
of cases in this intermediate category: the proportions being .401 and .183 for 
the configuration and regression instruments respectively. 


TABLE 3 


Proportion of Cases in Each Criterion Category of Validation Samples Predicted 
Correctly by Alternative Prediction Methods 


Dichotomous criterion 
Prediction 


method 


Failing 


Predictive configuration 
Multiple linear regression 
Glueck 

Burgess unit-weighting 


.662 
-619 * 
.594 * 


Borderline Failing 


Predictive configuration .808 
Multiple linear regression .829 


-481 
556 


figuration method is statistically significant at the .05 level determined by the probability 
distribution of the difference of two proportions based upon large samples, two-tailed test. 
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Earlier prediction studies have indicated that the cases in categories other 
than the highly successful one are the most difficult to predict. The ex- 
planation most frequently advanced is that these cases represent more hetero- 
geneous groups of individuals. The superiority of predictive configuration 
instruments in this respect may be due to their being based on the use of 
samples of greater homogeneity than those incorporated in measurement 
methods. It is this superiority that accounts for much of the greater over-all 
accuracy of these instruments. 


SUMMARY 


This analysis has established the effectiveness of the method of predictive 
configurations, a nonmetric method, in constructing instruments for predicting 
scholastic success. Predictions resulting from the use of these instruments 
were compared with those obtained from instruments based on the multiple 
linear regression, Burgess unit-weighting, and Glueck methods. The instru- 
ments based on the method of predictive configurations were equal to or 
superior to the others in two respects. First, they were easier to use in pre- 
dicting for subsequent samples. Neither quantitative factor coefficients nor 
numerical category weights were needed to predict for an individual case. 
Second, the predictive configuration instruments were more accurate and 
efficient in predicting a criterion with either two or three alternatives. Of the 
other methods tested, multiple linear regression yielded predictions most 
nearly approaching those of the configuration instruments in accuracy. The 
superiority of the configuration instruments was largely the result of their 
greater ability to predict certain categories of the criterion. They tended to 
be more accurate in predicting the criterion category containing the more 
heterogeneous portion of the sample. The characteristics of the method of 
predictive configurations should make it especially useful wherever individual 
behavior or adjustment results from a complex of interacting factors. 
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Self-Perceptions Regarding Masculinity-Femininity and 
Descriptions of Same and Opposite Sex 
Sociometric Choices 


RicHarp M. Lunpy, University of Wisconsin 


One of the more persistent problems in the area of sociometric choices is 
the study of the personal characteristics of persons chosen as best liked or as 
least liked. An approach to this study is to consider how these best: or Jeast 
liked people are perceived by the individuals doing the choosing. In this ap- 
proach the concern is not with the characteristics of the perceived but with 
the ideas or conceptions that the perceiver has of him. 

Fiedler (2) has described an aspect of the perception of other people 
called assumed similarity. In studies concerned with this aspect, the interest 
is focused upon the degree of similarity between A’s perception of B and A’s 
perception of himself. This may be distinguished from empathy or accuracy 
of perception studies which measure the degree of similarity between A’s 
perception of B and B’s perception of himself. In the present study we «all 
deal with the assumed similarity—here called ascribed similarity—betwec . a 
person and his sociometric choices. 

In a previous study (6) dealing with descriptions of opposite sex sociometric 
choices, a person’s self-perception was found to be more similar to his descrip- 
tions of a positive sociometric choice than to his description of a negative 
sociometric choice. Further, persons tended to describe their positive socio- 
metric choices as more similar to acceptable self-descriptions than to un- 
acceptable self-descriptions, and to describe their negative choices as more 
similar to unacceptable than to acceptable self-descriptions. 

Because earlier studies (3, 7) had demonstrated these same findings in 
descriptions of same sex sociometric choices, the conclusion may be drawn 
that these statements of relationships between self-perceptions and descrip- 
tions of sociometric choices are general principles which hold whether the 
sociometric choices are of the same or the opposite sex. 

This conclusion does not, however, fit in well with the generally accepted 
notion of complementarity, which states that persons prefer members of the 
opposite sex in whom they see aspects which complement (may be opposite 
to) their own personality. Complementarity is most clearly exemplified in 
the area of masculinity-femininity and may be represented by the preference 
of the very masculine male for the very feminine female. If this example 
represents general trends, we would expect the previously stated principle 
regarding sociometric choice and perceived similarity to self to be reversed; 
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so that the positively chosen female would be perceived by the male as dif- 
ferent from himself regarding masculinity while the negatively chosen female 
may be perceived as s'milar to himself. In the same way the male may ascribe 
the unacceptable (feminine) aspects of himself to a positive sociometric 
female and ascribe his acceptable qualities (masculinity) to a less positive 
female sociometric choice. 

It becomes apparent that two contradictory notions may be proposed in 
the area of masculinity-femininity, regarding se!f-perceptions and descrip- 
tions of opposite sex sociometric choices. The first notion states that we gen- 
erally prefer opposite sex members who are perceived similar to us, even in 
the area of masculinity-femininity. The second notion states that we generally 
prefer opposite sex members who are perceived different from us in the area 
of masculinity-femininity. 

An attempt to resolve this contradiction, however, should also consider the 
possible effects of the perceiver’s own sex identification. We may find that the 
results will vary depending upon the degree to which the male perceiver is 
masculine and the female perceiver is feminine. 

If the area of interest is masculinity-femininity we should also provide an 
opportunity to compare the differences, if any, between perceptions of the 
same sex and perceptions of the opposite sex. On the basis of the earlier 
studies (6, 7) we might expect no differences between same and opposite sex 
choices. Yet from a very general projection hypothesis we would expect males 
_to perceive more similarity between themselves and other males than between 
themselves and females, especially in the area of masculinity-femininity. 
Similarly, we would expect females to perceive more similarity between them- 
selves and other females than between themselves and males. 

This study, then, is generally concerned with the ascribed similarity in the 
area of masculinity-femininity between self-perceptions, both total and ac- 
ceptable aspects, and perceptions of others. The three variables specifically 
concerned are: (a) whether the other person is liked or disliked, (b) whether 
the other person is of the same or opposite sex, and (c) whether or not the 
perceiver is strongly identified with his own sex. 


PROCEDURE 
Subjects 

The Ss were 48 male and 88 female students in an intermediate course in 
psychology. 
Masculinity-Femininity Description Blank 


The M-F Scale of the Minnesota Multiphasic Personality Inventory was 
selected as the instrument to measure self-perceptions and descriptions of 


j 
: 
| 
4 
4 
Bi 
: 


240 SOCIOMETRY 


others in the area of masculinity-femininity. The 6C numbered statements 
from the M-F scale were presented on separate sheets of paper followed by 
three sets of instructions and three IBM answer sheets for each S. Two sets 
of instructions were the same for every S. The first of these told the S to 
respond True to a statement if it applied to him and False if the statement 
did not apply. The second set of instructions asked the S$ to decide whether 
or not each statement would characterize him as he would most like to be. 
These two instructions and the descriptions are called Self and Ideal respec- 
tively. The third set of instructions concerned the description of another per- 
son. The S was requested to describe a positive sociometric choice of the same 
sex as themselves, a positive choice of the opposite sex, a negative choice of the 
same sex, or a negative choice of the opposite sex. 

The same sex, positive choice instructions read as follows: “You are to 
consider each statement separately and decide whether or not it would charac- 
terize the student here at school with whom you most like to spend your time. 
Select a person of your own sex whom you know fairly well.” The words, 
“least like” and “the opposite sex’”’ were exchanged for “most like” and “your 
own sex” to make the other combinations of instructions. 

The order of presentation of the three sets of instructions (Self, Ideal, 
Other) was systematically varied to control for possible sequence effects. 
The instruction sheet presented first also requested each S to respond to every 
statement, to answer as truthfully as possible, to keep the questionnaire 
anonymous, and to indicate the sex of the S. 


Data Analysis 


The self-responses to the list of statements were scored first and the scores 
for males and for females were distributed so that both sexes could be 
divided into high and low M-F Score Ss. Some Ss were randomly removed 
from the data until equal cell entries were obtained in a 2 x 2 x 2 factorial 
design with variables: (a) positive or negative sociometric choice, (b) same 
or opposite sex chgice, and (c) high or low sex identification. (The high 
identifiers were those males with low M-F scores and those females with high 
M-F scores). The resulting Ns for the eight cells were 6 for the males and 11 
for the females. 

Three scores were obtained from each S—Total Ascribed Similarity, Ac- 
ceptable Ascribed Similarity, and Unacceptable Ascribed Similarity. Total 
Ascribed Similarity includes the number of items to which the S responds 
the same for the Self and for the Other. The possible range of scores was 
from 0 to 60. 

The second score represents an attempt to measure the degree to which 
a person ascribes his acceptable qualities to another person. Acceptability 
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was defined as the degree of similarity between Self and Ideal descriptions; 
thus the Total Acceptability score is a count of those items to which the S 
responded the same for Self and for Ideal. Because the Total Acceptability 
score varied among Ss, the number of times the S ascribed these acceptable 
items to the Other was divided by Total Acceptability to give the Acceptable 
Ascribed Similarity score. 

In the same fashion the. Unaccepted Ascribed Similarity score was obtained. 
The Total Unacceptability score represented those items to which the S 
responded differently for Self and Ideal. The number of times the S ascribed 
these unacceptable items to the Other was divided by the Total Unaccepta- 
bility score to make up the Unacceptable Ascribed Similarity score. 


RESULTS 


Table 1 presents the analysis of the male Total Ascribed Similarity Scores. 
The Sociometric Choices main effect is the only significant finding. In this 
difference the positive sociometric choice receives significantly more ascribed 
similarity than the negative sociometric choice receives. 

The analysis of the similar data for female Ss also finds Sociometric Choice 
significant. In addition the interaction between Sex Choices and Sex Identi- 
fication reaches significance. In this interaction the Ss with high Sex Identifica- 
tion (high femininity for female Ss) have higher Ascribed Similarity scores 
when they perceive Same Sex Choices while the Ss with low Sex Identification 
have higher Ascribed Similarity scores when they perceive Opposite Sex 
Choices. 


TABLE 1 


Analysis of Variance of the Total Ascribed Similarity Scores for Males 


Sums of Mean 
Source squares square 


Sociometric choices 645.34 645.34 
Sex choices 33.34 33.34 
Sex identification 30.11 30.11 
Interaction: sociometric 

choice x sex choices 33.31 33.31 
Interaction: sociometric 

choice x sex identification 14.04 14.04 
Interaction: sex choices x 

sex identification 70.04 70.04 
Triple interaction 2.18 2.18 
Error 2621.31 65.53 


* Significant beyond the .05 level. 
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TABLE 2 


Analysis of Variance of the Total Ascribed Similarity Scores for Females 


; Sums of Mean 
Source squares square 


Sociometric choices 1057. 
Sex choices 
Sex identification Ss 29. 
Interaction: sociometric 
choice x sex choices 
Interaction: sociometric 
choice x sex identification 21. 
Interaction: sex choices x 
sex identification 189.11 
Triple interaction 20.96 
Error 3610.57 


* Significant beyond the .05 level. 


When the male and female data are combined, both the Sociometric Choices 
main effect and the Sex Choices by Sex Identification interaction effect remain 
significant. 


Tables 3 and 4 present the analyses of the Acceptable Ascribed Similarity 


scores for males and for females. In each case the only significant difference 
is between the Sociometric Choices; the positive choice receives more 
Acceptable Ascribed Similarity than the negative choice receives. 


TABLE 3 
Analysis of Variance of the Acceptable Ascribed Similarity Scores for Males 


Sums of Mean 


Source squares df square 


Sociometric choices 3906. 1 3906 .02 
Sex choices 7.3 1 17.5 
Sex identification 1 725 
Interaction: sociometric 

choice x sex choices 
Interaction: sociometric 

choice x sex identification 
Interaction: sex choices x 

sex identification 426. 
Triple interaction 
Error 11817. 


* Significant beyond the .05 level. 


242 

+ 

— 

09 23.42 * 

01 

Bip)’ 30 

03 : 

11 4.19* 

96 — 

13 oe 

5 

at 

{ 

F 

13.22 * 

— 

: 

1.44 


MASCULINITY-FEMININITY AND SOCIOMETRIC CHOICES — 243 


TABLE 4 
Analysis of Variance of the Acceptable Ascribed Similarity Scores for Females 


Sums of Mean 
Source squares square 
1 


Sociometric choices 5744.56 5744.56 
Sex choices .92 
Sex identification 23.01 23.01 
Interaction: sociometric 

choice x sex choices 36.92 36.92 
Interaction: sociometric 

choice x sex identification 280.10 280.10 
Interaction: sex choices x 

sex identification 94.10 94.10 
Triple interaction 24.02 24.02 
Error 13722.45 171.53 


* Significant beyond the .05 level. 


The analysis of the Unacceptable Ascribed Similarity scores finds only one 
significant F. This result, presented in Table 5, is the interaction between 
Sex Choices and Sex Identification in the female data. The relationship is the 
same as that found in Table 2; i.e., Ss with high Sex Identification perceiving 


same Sex Choices and Ss with low Sex Identification perceiving opposite Sex 
Choices tend to have higher Unacceptable Ascribed Similarity scores. No 
significant findings are revealed in the male data. 


TABLE 5 


Analysis of Variance of the Unacceptable Ascribed Similarity Scores for Females 


Sums of Mean 
Source squares square 


Sociometric choices 113.63 113.63 
Sex choices 58.91 58.91 
Sex identification 122.91 122.91 
Interaction: sociometric 

choice x sex choices .43 
Interaction: sociometric 

choice x sex identification 84.06 
Interaction: sex choices x 

sex identification 3049.15 
Triple interaction 327.12 
Error 29734.11 


* Significant beyond the .05 level. 
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DISCUSSION 


The first question to discuss is whether or not the M-F scale does, in fact, 
serve to distinguish between males and females in our sample. An examination 
of the scores for the self revealed very little overlap between males and 
females. The t between the male and female means, presented in Table 6, 
being quite substantial (7.1), we must conclude that the males and females 
were responding quite differently to the test. 

The relationship between positive or negative sociometric choice and the 
ascribed similarity measures is quite definite. The positive choice is perceived 
as more similar to the self and as more similar to the accepted aspects of the 
self. This is true for both male and female subjects. This finding, plus those 
from other studies (6, 7), suggests that the process of ascribing similarities, 
both of one’s total self-picture and of only the acceptable aspects of the self, 
distinguishes our perception of liked people from that of unliked people. And 
this is the case whether the liked and unliked people are the same or opposite 
sex and whether the personality characteristics perceived are more general or 
relate specifically to the masculinity-femininity dimension. 

Since the acceptable aspects of the self are operationally defined as those 
self qualities which are also ideal, the interpretation of these results could very 
well be that the positive sociometric choice is idealized in the Ss’ perceptions 
of him. In this sense the liking of a person results in one perceiving him as 
ideal, correct, or perfect. Berkowitz (1) has demonstrated that the liking of 
group members increases the perception of the merit of the members’ 
opinions. In the same way the studies by Horowitz, et al. (4) and Jordan (5) 
suggest that the liking of a person increases the likelihood of a positive evalua- 
tion of his behavior. 

The striking feature of the present results is that, interpreted from this 
frame of reference, the liked opposite-sex positive choice is idealized in terms 
of same-sex qualities. That is, a male S ascribes more idealized masculine 
qualities to a liked female than to an unliked female, in the same way that 
he ascribes more of these qualities to a liked male than to an unliked male. 


TABLE 6 
Male and Female Means on the M-F Scale of the MMPI 


Male Female Difference 


7.82 


* Significant beyond the .05 level. 
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Complemental effects are not apparent in these perceptual data. From a 
complemental point of view we should expect the best liked opposite-sex 
person to be perceived as more different from the self and from the acceptable 
aspects of the self. Instead we find that the least liked opposite-sex person is 
perceived as more different. Yet these results do not directly comment on the 
complementarity of needs hypothesis of mate selection as developed by 
Winch (9). In Winch’s hypothesis, that dissimilar needs are a basis for 
strength and “equilibrium” in a marriage, these needs may or may not be 
conscious or perceived in the partner. In fact the varying degree of 
consciousness of the need is a source of distortion in the relationship. 

The present results would suggest that complementarity, if it exists in the 
area of masculinity-femininity, is not a measurable thing from a perceptual 
point of view. People certainly do not perceive liked opposite-sex members as 
complementing their masculinity or femininity; they perceive those members 
as being similar to them in this regard. If similarities between opposite-sex 
partners also lead to the reinforcement and strengthening of their relationship, 
as suggested by Rosow (8) and many others, then the strength of the rela- 
tionship that the partners perceive is the aspect of similarity and not the 
aspect of complementarity. 

The general process of ascribing similarities between oneself and one’s own 
sex gains little support from these data. Since the analysis of the results shows 
no differences between same and opposite sex perceptions and no interactions 
between sex of choice and sociometric choice, we may conclude from these 
data that the individual S does not perceive positive or negative members of 
his own sex, in contrast to the opposite sex, as more similar to himself or to 
those acceptable aspects of himself. And this in spite of the fact that we are 
dealing with material in the masculinity-femininity area and that males and 
females do differ in respect to their self-perceptions. 

We do have some evidence, from Tables 2 and 5, however, that points 
toward a very general process for the female high sex identifiers, i.e., the 
“feminine” females, to ascribe more similarity between themselves and their 
own sex than between themselves and the opposite sex. That this is a general 
ascribing of similarities, or assimilative projection as it has been called else- 
where (6), is noted in the fact that this direction of results obtains for accept- 
able, unacceptable, and total self-perceptions, the latter two being statistically 
significant. 


SUMMARY 


One hundred thirty-six male and female college students were administered 
the 60 items of the M-F scale of the MMPI. Each S’s responses for Self, Ideal, 
and Other (either Positive-Same Sex, Negative-Same Sex, Positive-Opposite 
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Sex, or Negative-Opposite Sex Sociometric choices) were scored to arrive at 
Total Ascribed Similarity, Acceptable Ascribed Similarity, and Unacceptable 
Ascribed Similarity scores. 

The results strongly support the conclusion that even in the masculinity- 
femininity dimension the liking of a person, whether same or opposite sex, 
is directly related to the degree to which he is perceived as similar to oneself 
and to one’s acceptable aspects of the self. Thus the notion of perceptual 
complementarity is not supported. 

Although males and females respond differently to the scale, there are no 
differences for the ascribed similarity measures between the same and opposite 
sex. Female high sex identifiers, however, tend to ascribe more similarity to 
same than to opposite sex members. 


Manuscript received: January 20, 1958 
Revised manuscript received: April 7, 1958 
Richard M. Lundy 
Department of Psychology 
University of Wisconsin 
Madison 6, Wisconsin 
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Cultural Norms and Husbands’ and Wives’ Reports of Their 
Marital Partners’ Preferred Frequency of Coitus 
Relative to Their Own ' 


Paut WALLIN, Stanford University 
ALEXANDER CLARK, Stanford University 


A major component of the ideology of sex in American society of the past 
was the belief that the male, by nature, had a more imperative and constant 
need than the female for sexual satisfaction.* This belief made it normative 
for the sexes, before and after marriage, to conceive of men as desiring and 
requiring sexual intercourse more often than women. In marriage, women were 
expected to defer to the greater desire of men as a matter of wifely duty. 

As a consequence of the norm the sense of masculinity of husbands rested 
in part on their being able to conceive of their sex drive as stronger than 
that of their wives. The latter, in turn, could conceive of themselves as appro- 
priately feminine partly by being able to impute to themselves a lesser desire 
for sexual, gratification. In any given marriage, therefore, a condition of the 
maintenance of husband’s and wife’s self-esteem was that both perceive the 
former as having the stronger sex drive. 

It is a reasonable inference that this cultural definition of the difference 
in sex drive between men and women has been changing—at least in the 
middle class—to a conception of the sexes as more or less equal in their desire 
for sexual satisfaction. Some such change was to be expected in view of the 
growing equalization of the status of men and women, the increasing emphasis 
on wome’s right to sexual gratification, and the considerable diffusion in the 
past three or four decades of the belief that sexual compatibility was important 
for marital success. The sexual difference between men and women postu- 
lated before the turn of the century could hardly be sustained in juxtaposition 
with the latter themes. 

Impressionistic evidence supports the assumption that the current norm 


1This is one of a group of articles reporting the findings of a study dealing with 
questions relating to the sexual behavior of a group of 604 men and women before and 
after marriage. Acknowledgment is due for support given the study by the Rockefeller 
Foundation and the Social Science Research Council. The research was also supported in 
part by a grant from the Committee for Research in Problems of Sex, National Academy 
of Sciences—National Research Council. The senior author is indebted to Ernest W. 
Burgess with whom he collaborated on the long-term investigation of engagement and 
marriage in the course of which the data of the sex study were collected. For their work 
on various phases of the study thanks are due to Fred Chino, Jan Howard, and Don 
Mills. The present article profited considerably from comments by Theodore Caplow. 

2 The research reported in this article is not concerned with whether in fact there are 
biologically determined differences in the sex drive of men and women. 
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in regard to the sex drive of men relative to women is that of equality, but it 
also supports the assumption of persistence of the older norm in some degree 
in that it presumably remains acceptable for husbands but unacceptable for 
wives to have the stronger drive. Modern husbands tend to be thought of as 
welcoming an equality of sex drive in their wives. But given inequality in 
marital partners it is supposed that while husbands may feel frustrated if 
their wives have a lesser sexual desire, they feel threatened if their wives 
have a stronger desire. This supposition implies that the male’s sense of 
masculinity continues to be thought of as linked to his sexual performance, 
but as resting now on his perception of himself as being at least equal in sex 
drive to the female. In the case of wives, it is supposed that they are pleased 
if their sex desire equals that of their husbands, and, depending on their 
general marital satisfaction, more or less tolerant of the desire of their hus- 
bands if in excess of their own. If, however, the desire of wives exceeds that 
of their husbands, the supposition is that wives tend to suffer some strain, 
born of a situation which, at one and the same time, presents them with a 
culturally defined image of their husbands as “undersexed” and therefore 
somewhat less masculine than they should be, and of themselves as ‘“oversexed”’ 
and consequently somewhat less feminine than they should be. 

Insofar as the assumption advanced above regarding the contemporary 
norm for the relative sex drives of men and women are correct, data collected 


from married couples should reveal (a) the acceptability to marital partners 
of an equally strong sex drive in men and women, (b) the acceptability of a 
stronger sex drive in men, and (c) the unacceptability to husbands and wives 
of a weaker drive in the former. The research here reported presents evidence 
consistent with these propositions, and consequently offers empirical support 
for the hypothesis that the norm in question is operative in urban middle- 
class American society, from which most of the study couples were drawn. 


SAMPLE AND DATA 


The data for the present research were collected as part of a larger project 
described elsewhere in detail (1). In its first phase the project was concerned 
with 1,000 engaged couples. Approximately 65 per cent (666) of the couples 
were restudied after a few years of marriage. At that time, 604 husbands and 
wives, independently of one another and under supervision, filled out ques- 
tionnaires that yielded the data used for the study reported here.* 

The couples were volunteers and received no payment. They were white, 
native-born, and predominantly residents of metropolitan Chicago. Only a 

3 Of the 666 couples restudied 62 filled out an abbreviated form in which questions re- 


lating to sex were omitted. For reasons footnoted in Table 2 the present study is based on 
459 of the 604 couples from whom the sex data were solicited. 
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fifth of the men and about a third of the women lacked some college educa- 
tion. About a fourth of the former and a tenth of the latter were at the gradu- 
ate or professional level of education. Somewhat more than half of the 
couples were Protestant, the others being Jewish or Catholic or having no 
religious affiliation. The average age at marriage was 25.7 years for the men 
and 23.9 for the women. 

When restudied, about three fourths of the couples had been married be- 
tween three and five years. A small percentage had been married under three 
years and the marriages of the remainder were of six to nine years duration. 
The findings presented below are thus for the early years of marriage.* Data 
are currently being collected from these couples in the middle years of 
marriage. 

The present study required data for grouping couples—as described in the 
next section—in terms of equality of, or difference between, marital partners 
in sex drives. The measure of sex drive used was reports by couple members 
of their preferred monthly frequency of coitus. 

The study further required a criterion of the acceptability to husbands and 
wives of the strength of their marital partners’ sex drive relative to their 
own. The criterion employed was the congruence between (a) a couple mem- 
ber’s characterization of the spouse’s sex drive as equal to or higher or lower 
than his own as inferred from his report, both of his own and the spouse’s 
preferred monthly frequency, and (b) the relative sex drive of the given 
couple member and his spouse as inferred from the reports by each of them of 
their own frequency preference. This criterion will not be elaborated at this 
point since it can be more readily understood from its use as indicated in the 
analysis of the data. 

The data used for the purposes described above were the responses of 
husbands and wives to the following questions: 


1. About how many times per month would you prefer to have sexual 
intercourse? 


About how many times per month do you think your wife (husband) 
would prefer to have sexual intercourse? 


The utilization of responses to these questions in the present investigation 
was based on the assumption that neither men nor women would be con- 
strained by a cultural norm to report their own preferred frequency as other 
than what it was. This followed from the presumption that both sexes have 
no culturally given conception of what constitutes an appropriate preferred 
frequency for members of their sex.’ What is culturally defined—as hypothe- 


4 This phase of the study was completed in 1946. 
5 The data of the present research were collected before publication of the Kinsey 
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sized and tested in this investigation—is the proper preferred frequencies of 
the sexes relative to one another. 

Assuming that husbands and wives are “culture free” to report their own 
preferred frequency the problem remains of how accurately they can do so. 
Unfortunately, no evidence is available on the accuracy or consistency of these 
reports. The use of what would seem to be an obvious check on the former, 
namely the comparison of the preference reported by one partner with the 
spouse’s estimate of that preference, is ruled out by the finding of the present 
research that some marital partners are subject to systematic bias in reporting 
their spouses’ preferred frequency of coitus. 


ANALYSIS AND FINDINGS 


Couples were divided into the following three classes by the reports of 
husbands and wives of their own preferred frequency of intercourse: 


Class A: Couples in which husband and wife had the same preference 

Class B: Couples in which the husband’s preference was higher than the 
wife’s 

Class C: Those in which the wife had the higher preference 


In order to allow for a margin of chance error in self-reports of their average 
monthly preferred frequency, husbands and wives who differed by one or two 
in their indicated monthly preferences were placed in Class A, along with 
those marital partners who were in perfect agreement. Couple members who 
differed by three or more in their preferred monthly coital rate were placed in 
Class B or C, depending on whether it was the husband or wife who had the 
higher preferred frequency. The percentages of couples in the three classes 
were respectively 51, 32.4, and 16.6. 

According to the hypothesis of the study, the husbands and wives com- 
prising the Class C couples should find unacceptable, and those in Class A 
and B should find acceptable, the sex drive of their marital partners relative to 
their own. Class C couples are those in which by self reports of husbands and 
wives the latter have the stronger sex drive. Thus, these couples do not con- 
form to the norm—the operation of which is here hypothesized—that marital 
partners be equal in sex drive or that, if they differ, the husband should have 
the stronger drive. Consequently, assuming that persons are motivated to 
perceive or represent themselves to others as satisfying cultural norms— 
especially those relating to masculinity and femininity—if the norm is indeed 
operative, it would be predicted that Class C husbands and wives would give 
evidence of finding their situation unacceptable by representing themselves 


statistics on the frequency of marital coitus among various subgroups in American society. 
It is possible that some persons now derive a conception of the preferred frequency 
appropriate for them from these statistics on actual frequency of marital coitus. 


| 

| 

| 

| 

| 

poly 

| 

| 


NORMS AND REPORTS OF MARITAL PARTNERS 


TABLE 1 


Mean Monthly Preferred Frequency of Coitus of Husbands and Wives by Self-reports 
and Estimates of Marital Partners for Couples Grouped into Three Classes * 


Class A Class B Class C 


Husband and wife Husband's pref- Wife's pref- 
have same preference * erence higher 7 erence higher *¢ 


(N = 234) (N = 149) (N = 76) 


Mean of husbands’ reports of 


their own preferred frequency 7 = 3. 6.49 o=3. 
Mean of husbands’ estimates of 

wives’ preferred frequency “aa .0! =5.8 6.88 6 = 4.04 
Mean of wives’ reports of their 

own preferred frequency 7 6.24 11.67 o= 4.91 
Mean of wives’ estimate of 

husbands’ preferred frequency 9.00 o— 4.74 10.62 ‘ 11.79 6 =4.89 


* The total of the three classes is considerably less than that of the total sample (604) 
because it excludes couples in which either husband or wife failed to report their own 
preferred frequency and that of the marital partner. Additional couples were excluded 
from this analysis because husband or wife responded qualitatively (e.g., often) or by 
specifying a broad range of frequency (e.g., 10 to 20).° 

+ By husbands’ and wives’ reports of their own frequencies. Marital partners are classi- 
fied as having the same preference if they did not differ by more than 2. Husbands (and 
wives) are classified as higher if they exceeded their spouses’ by 3 or more. 


as approximating the norm more closely than in fact they do. The subse- 
quent analysis shows this to be so. 

Table 1 gives four means for each of the three classes of couples. These 
are: (a) the mean of husbands’ reports of their own preference, (b) the mean 
of husbands’ estimates of their wives’ preference, (c) the mean of wives’ 
reports of their own preference, and (d) the mean of wives’ estimates of their 
husbands’ preference. 

Class C husbands, on the average, report for themselves a preferred 
monthly frequency of 6.49 and impute a preference of 6.88 to their wives. 
The wives report a preference of 11.67 per month for themselves, and, on the 
average, impute a preference of 11.79 to their husbands. Thus Class C persons 
tend to report their own and their marital partners’ preferred frequencies as 
alike, whereas the reports by husbands and wives of their own preferences 
represent them as unlike, with the preferred frequency of wives almost twice 
that of the husbands.® 


6 The difference between the mean frequency of the self-reports of husbands and that 
of wives is significant at the .01 level, using the formula for correlated means; r = .79. 
The means in Table 1 are all computed from distributions that are approximately normal. 
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Here and subsequently the assumption is made that, after three to five 
years of marriage, persons have a basis for approximate knowledge of the 
preferred frequency of coitus of their marital partners. It seems unlikely, for 
example, that Class C wives would not be aware of the fact that their hus- 
bands’ preferred frequency is substantially lower than their own. Nor does 
it seem likely that these husbands would not be cognizant of the fact that 
their wives’ preferred rate of intercourse was considerably in excess of their 
own, even though the wives were discreet about their greater preference. 

Class B persons “behave” quite differently from those in Class C. By 
self-reports of husbands and wives, Class B couples conform to the hypoth- 
esized norm that if marital partners differ in sex drive it should be the hus- 
band who has the stronger desire. And in Class B it is to be noted that hus- 
bands, on the average, report a substantially higher preference for themselves 
than they impute to their wives.‘ The wives, on the other hand, report a sub- 
stantially lower preference for themselves than they impute to their husbands.* 

By self-reports of husbands and wives, Class A couples also satisfy the 
hypothesized norm with couple members reporting approximately the same 
drive. And in this class the average of husbands’ and wives’ reports of their 
own preferred frequency correspond to the average frequency which they im- 
pute to their marital partners, the correspondence being much closer for the 
men than for the women.’ 

It remains to be shown, however, that the indicated difference between 
Class C persons and those in Classes A and B inferred from the comparison 
of group averages also obtains when the data are analyzed in terms of indi- 
viduals. That such is the case is demonstrated below. 

The following percentages were calculated for the three classes of couples. 
Class A, 1. the percentage of cases in which the husbands’ estimates of their 
wives’ preferred monthly frequency did not differ from the husbands’ reports 
of their own preference by more than two; 2. the percentage of cases in 
which the wives’ estimates of their husbands’ preferences did not differ from 
the wives’ reports of their own preference by more than two. Class B, 1. the 
percentage of cases in which the husbands imputed to their wives a pre- 
ferred frequency lower than their own by three or more; 2. the percentage of 
cases in which the wives estimated their husbands’ preferred frequency as 

7 Significant at .01 level; r= .48. 

8 Significant at .01 level; r= .41. 

® The difference between the mean of wives’ reports of their own preferred frequency 
(7.39) and the mean of their estimates of their husbands’ preference (9.00) is sig- 
nificant at the .01 level; r= .71. Since this difference is not consistent with the relative 
size of the means of husbands’ and wives’ self-reports it suggests that Class A wives tend 


to think of, or prefer to represent, their husbands’ sex drive as slightly stronger than 
their own. 
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NORMS AND REPORTS OF MARITAL PARTNERS 


TABLE 2 


Confirmation Percentages Based on Husbands’ and Wives’ Reports for 
Three Classes of Couples 


Class A Class B Class C 
Husband and wife Husband's pref- Wife’s pref- 
have same preference * erence higher * erence higher * 
(N = 234) (N = 149) (N = 76) 
Husband’s report of own and esti- 
mate of his wile’s preference 77.8 52.3 14.5 
: Wife’s report of own and estimate 
3 of her husband’s preference 71.4 50.3 11.8 


* By husbands’ and wives’ reports of their own preferred frequencies. Marital partners 
are classified as having the same preference if they did not differ by more than 2. 
Husbands (and wives) are classified as higher if they exceeded their spouses by 3 or more. 


higher than their own by three or more. Class C, 1. the percentage of cases 
in which the husbands imputed to their wives a preferred frequency higher 
by three or more than husbands reported for themselves; 2. the percentage 
of cases in which the wives reported their own preferred frequency as higher 
by three or more than their estimates of their husbands’ preference. These 
percentages, designated as confirmation percentages, are given in Table 2. 
The percentages for the three classes of couples indicate the extent to which 
husbands’ and wives’ estimates of their marital partners’ preferred frequency 
relative to their own are in accord with the classification of couple members 
(as having the same preference, husband’s preference higher or wife’s prefer- 
ence higher) based on their own reports of their preferred frequency of coitus. 


The differences between the three classes of couples revealed in Table 2 
s point to the same conclusion as that derived from the comparison of the 
2 group means of Table 1. The confirmation percentages are lowest by far for 
¥ the husbands and wives who constitute the Class C couples.!° By the reports 
u of the husbands and wives of their own preferred frequency of coitus Class 
aoe C couples are marriages in which the wife’s preferred monthly frequency 
4 exceeds that of the husband by three or more. But only 14.5 per cent of the 
a Class C men and 11.8 per cent of the women so represent the preferred fre- 
3 quency of their marital partners relative to their own. The remaining wives 
ree predominantly (76 per cent) report the preferred frequency of their husbands 
as as the same as their own. The remaining husbands (75 per cent) likewise 


impute to their marital partners the same preferred frequency as they report 
for themselves. 


10 The differences between (a) 14.5 and 52.3, (b) 52.3 and 77.8, (c) 11.8 and 50.3, 
and (d) 50.3 and 71.4, are significant at the .01 level. 
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The size of the confirmation percentages for Class B persons argues for 
the hypothesis of the study that if couple members differ in sex drive it is 
culturally acceptable to husbands and wives for the former to have the 
stronger drive. It is interesting to note, however, that, except for a small 
proportion (7 per cent), those Class B husbands who do not report their 
preferred frequency as higher than that of their wives (i.e., the nonconfirming 
cases) impute to the latter a preference equal to their own. And all but one 
of the wives who do not report their husbands as having a higher preferred 
frequency than their own represent them as having a preferred rate of coitus 
equal to their own. There is thus an indication in the behavior of almost half of 
the Class B husbands and wives that the norm influencing their representation 
of their marital partners’ drive relative to their own is that the sexes are more 
or less equal in sex drive. 

This norm is most evident in the behavior of Class A husbands and wives 
who by their self-reports are equal in sex drive as measured by their pre- 
ferred frequency of coitus. Seventy-eight per cent of the husbands so report 
their preferred frequency relative to their wives. And 71 per cent of the wives 
so represent their preference relative to that of their husbands. The persons 
in Class A not embraced by the confirmation percentages tend to rate the 
husbands as having a higher preferred frequency than the wives. 

The evidence adduced in Tables | and 2 is therefore consistent with the 
assumption tested by this study that a cultural norm is operative which makes 
acceptable to marital partners an equality of sex drive in husband and wife 
or a stronger drive in the former but which makes unacceptable a stronger 
drive in the latter. This finding derives from data obtained from a group of 
young couples in the early years of marriage who were predominantly native 
born, urban, college-level Protestants. Although the husbands and wives were 
volunteer participants in the study, there is no reason to believe that the 
same norm would not be operative among couples having similar social 
characteristics and, more generally perhaps, among couples subscribing to 
the equalitarian concept of marriage. 
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